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CA125 GENE AND ITS USE FOR DIAGNOSTIC AND THERAPEUTIC 

INTERVENTIONS 



BACKGROUND OF THE INVENTION 

[0001] This application is a continuation-in-part of U.S. Provisional Application Serial No. 
60/284,175 filed April 17, 2001, U.S. Provisional AppUcation Serial No. 60/299,380 filed June 
19, 2001, U.S. Non-Provisional Application No. 09/965,738 filed September 27, 2001, and U.S. 
Provisional AppUcation Serial No. 60/345,180. This application is a continuation-in-part of 
provisional application 60/427,045 (filed November 15, 2002), all of which are hereby 
specifically incorporated by reference. Applicants hereby specifically claims the benefit of these 
prior filed applications under 35 U.S.C. § 1 19(e), and 120. 

[0002] The present invention relates generally to the cloning, identification, and expression 
of the CA125 gene's glycosylated amino terminal domain, the multiple repeat domain, and the 
carboxy terminal domain in vitro and, more specifically, to the use of recombinant CA125 with 
q)itope binding sites for diagnostic and therapeutic purposes. Additionally, flie genomic DNA, a 
molecule encoding a 5' upstream region of CA125 and a genomic DNA sequence for the amino 
taminal, extra cellular repeats and carboxy terminal of CA125 has been determined. 

[0003] CA125 is an antigenic detenninant located on the surface of ovarian carcinoma cells 
with essentially no expression in normal adult ovarian tissue. Elevated in the sera of patients 
with ovarian adenocarcinoma, CA125 has played a critical role for more than 15 years in the 
management of these patients relative to ttieir response to then^y and also as an indicator of 
recurrrat disease. 
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[00041 It is well established that CA125 is not uniquely expressed m ovarian carcinoma, but 
is also found in both normal secretory tissues and other carcinomas (i.e., pancreas, liver, colon) 
[Hardardottir H et ah. Distribution of CA125 in embryonic tissue and adult doivatives of ttie 
fetal periderm. Am J Obstet. Gyttecol. 163;6(1):1925-1931 (1990); Zurawski VR et al. Tissue 
distribution and characteristics of tiie CA125 antigen. Cancer Rev. 11-12:102-108 (1988); and 
O'Brien TJ et a/., CA125 antigen in human amniotic fluid and fetal membranes, Am J Obstet 
Qynecol 155:50-55, (1986); Nap M et al, Immunohistochonical characterization of 22 
monoclonal antibodies against the CA125 antigen: 2nd report fimm the ESOBM TD-1 workshop. 
Tumor Biology 17:325-332 (1996)]. Notwithstanding, CA125 correlates directly with the 
disease status of affected patients (i.e., progression, regression, and no change), and has become 
the "gold standard" for monitoring patiaits with ovarian carcinoma [Bast RC et al, A 
radioinmiunoassay using a monoclonal antibody to monitor the course of epithelial ovarian 
cancer, N Engl J Med. 309:883-887 (1983); and Bon GC et al. Serum tumor marker 
immunoassays in gynecologic oncology: Establishment of reference values. Am J Obstet. 
Gynecol 174:107-1 14 (1996)]. CA125 is especially useful in post-menopausal patients where 
endometrial tissue has become atrophic and, as a result, is not a major source of normal 
circulating CA125. 

[0005] During the mid 1980's, the inventor of tiie present invention and others developed 
Ml 1, a monoclonal antibody to CA125. Ml 1 binds to a dominant qpitope on the repeat structure 
of the CA125 molecule [O'Brien TJ et al. New monoclonal antibodies identify the glycoprotein 
carrying the CA125 epitope. Am J Obstet Gynecol 165:1857-64 (1991)]. More recently, the 
invoitor and others developed a purification and stabilization scheme for CA125, which allows 
for the accumulation of highly purified high molecular weight CA125 [O'Brien TJ et al. More 
than 15 years of CA125: What is known about the antigen, its structure and its fimction, IntJ 
Biological Markers 13(4):188-195 (1998)]. 

[0006] Considerable progress has heea made over the years to fiirther characterize the 
CA125 molecule, its structure and its function. The CA125 molecule is a hi^ molecular weight 
glycoprotein with a predominance of O-linked sugar side chains. The native molecule exists as a 
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very large complex {^2-5 million daltons). The complex spears to be composed of an qpitope 
containing CA125 molecule and binding proteins which carry no CA125 epitopes. The CA125 
molecule is heterogenous in both size and charge, most likely due to continuous deglycosylation 
of the side chains during its life-span in bodily fluids. The core CA125 subumt is in excess of 
200,000 daltons, and retains the capacity to bind both OC125 and Mil class antibodies. 

[0007] Despite ttie advances in detection and quantitation of serum tumor markers like 
CA125, the majority of ovarian cancer patients are still diagnosed at an advanced stage of the 
disease-Stage HI or IV. Further, the management of patients' responses to treatment and the 
detection of disease recurrence remain major problems. There, thus, remains a need to 
significantly improve and standardize current CA125 assay systems. Further, the development 
of an early indicator of risk of ovarian cancer will provide a useful tool for early diagnosis and 
improved prognosis. 

SUMMARY OF THE INVENTION 

[0008] Thus, it is an object of the present invention to provide a recombinant CA125 cDNA 
molecule which can be introduced into animals or human cells to achieve transcription or 
expression of the cDNA. The utility of knowing the DNA sequence of a specific gene is that a 
recombinant protein can be produced which can be used as an easily renewable source of that 
gene or a portion of the gene. Producing and purifying recombinant protein is easier and can 
produce greater quantities of protein than purifying native protein firom patients. The 
recombinant protem can then be used to produce antibodies to the gene, both polyclonal and 
monoclonal. The recombuiant protein can also be used as a positive control in test kits and 
experiments. 

[0009] The genomic sequence for CA125 and a 5' upstream region has been determined. A 
DNA sequence showing the 5' upstream region and the amino terminal portion of the CA125 
molecule is set out m Table 1 and SEQ ID NO: 1. The extracellular amino terminal domain is 
made of exons: Exon 1 firom 2205-11679; Exon 2 from 13464-13570; Exon 3 from 16177- 
34419; Exon 4 from 34575-38024; Exon 5 &om 38689-38800; Exon 6 fiiom 40578-45257; Exon 
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7 from 47360-47395; Exon 8 from 52407-52442; Exon 9 from 52686-52744 as set out in SEQ 
ID NO: 1. A DNA sequence showing the extracellular repeat portion of the CA125 molecule is 
set out in Table 2 and SEQ ID NO: 2. The genomic repeats are made of exons: Exon Rl from 
1-130; Exon R2 from 442-510; Exon R3 from 5479-5652; Exon R4 from 6301-6334; Exon R5 
from 6593-6657; Exon Rl from 7558-7683; Exon R2 from 8216-8284; Exon R3 from 8877- 
9050; Exon R4 from 9380-9413; Exon R5 from 9675-9739; Exon Rl from 10201-10291; Exon 
R2 from 10524-10592; Exon R3 from 1 1200-11373; Exon R4 from 1 1722-1 1755; Exon R5 from 
12016-12036; ExonRl from 12169-12295; ExonR2 from 12532-12600; Exon R3 from 13219- 
13392; ExonR4 &om 13723-13756; ExonR5 from 14016-14077; Exon Rl from 15001-15126; 
Exon R2 &om 15367-15435; Exon Rl from 15648-15773; Exon R2 from 16002-16070; Exon 
R3 from 16653-16826; ExonR4 from 17158-17191; Exon R5 from 17453-17517; ExonRl from 
18532-18657; Exon R2 from 18888-18956; ExonR3 from 19633-19806; ExonR4 from 20141- 
20176; Exon R5 fix)m 20387-20449; ExonRl from 21609-21731; Exon R2 from 21940-22008; 
Exon R3 from 22605-22778; Exon R4 from 23109-23142; Exon Rl from 29046-29168; Exon 
R2 from 29266-29334; Exon R3 from 33917-34090; Exon R4 from 36702-36734; Exon R5 from 
38270-38320;Exon Rl from 39104-39224; Exon R2 from 39315-39383; Exon R3 from 39532- 
39705; Exon R4 from 41862-41992. A DNA sequence showing the carboxy terminal domain of 
the CA125 molecule is set out in Table 3 and SEQ ID NO: 3. The caiboxy terminal portion is 
made of exons: Exon CI from 1-66; Exon C2 from 1802 1947; E\on C3 from 4198-4350; Exon 
C4 from 4679-4747; Exon C5 from 681 1-6978; Exon C6 from 1 1232-1 1270; Exon C7 from 
1 1594-1 1677; Exon C8 from 14095-14187 as set out in SEQ ID NO: 3. A fiill length cDNA 
molecule for CA125 is set out in Table 4 and SEQ ED NO: 4. A CA125 protein is set out in 
Table 5 and SEQ ID NO: 5. 

[0010] Now that the DNA sequence which encodes CA125 has been discovered known 
synthetic methods can be employed to prepare DNA molecules containing portions of the 
sequence. Conventional cloning vehicles, such as plasmids, viruses, or bacteria phages can be 
modified usuig known methods so as to produce novel cloning vehicles which contain cDNA 
encoding, CA125, analogs or mutants thereof. Similarly, such cloning vehicles can be modified 
or engineered so that tiiey contain DNA molecules fix>m Table 4 and SEQ ID NO: 4 or segments 
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substantially similar thereto. The DNA molecule inserted may be made by various methods 
including enzymatic or chemical synthesis. 

[0011] The CA125 gene has been cloned and multiple repeat sequences as well as the 
glycosylated amino temiinal and the carboxy terminus have been identified CA125 requires a 
transcript of more than 35,000 bases and occupies approximately 150,000 bp on chromosome 
19p 13.2. The CA125 molecule comprises three major domains: an extracellular amino terminal 
domam (Domain 1); a large multiple repeat domain (Domain 2); and a carboxy tenninal domain 
(Domain 3) which includes a transmembrane anchor with a short cytoplasmic domain. The 
amino temiinal domain is assembled by combining five genomic exons, four very short amino 
temiinal sequences and one extraordinarily large exon. This domain is dominated by its capacity 
for 0-glycosylation and its resultant richness in serine and threonine residues. Additionally, an 
amino terminal extension is present, which comprises four genomic exons. Analysis of the 
amino terminal extension revealed that its amino acid composition is consistent with the amiao 
acid composition of the amino tenninal domain. 

[0012] Perhaps even more significantly, the multiple repeat domains of CA125 or other 
domains could also be used for the development of a potential vaccine for patients with ovarian 
cancer. In order to induce cellular and humoral immunity in humans to CA125, murine 
laitibodies specific for CA125 were utilized in anticipation of patient production of anti-ideotypic 
antibodies, thus indirectly allowing the induction of an immune response to the CA125 molecule. 
With the availability of recombinant CA125, especially domains which encompass epitope 
binding sites for known murine antibodies, it will be feasible to more directly stimulate patients* 
immune systems to CA125 and, as a result, extend the life of ovarian carcinoma patients. 

[0013] The recombinant CA125 of the present invention may also be used to develop 

therapeutic targets. Molecules like CA125, which are expressed on tiie surface of tumor cells, 

provide potential targets for immune stimulation, drug delivery, biological modifier delivery or 

any agent which can be specifically delivered to ultimately kill the tumor cells. Humanized or 

human antibodies to CA125 epitopes could be used to deliver all drug or toxic agents includmg 

radioactive agents to mediate direct killing of tumor cells. Natural ligands having a natural 
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binding affinity for domains on the CA125 molecule could also be utilized to deliver therapeutic 
agents to tumor cells. 

[0014] CA125 expression may further provide a survival or metastatic advantage to ovarian 
tumor cells. Antisense oligonucleotides derived from the CA125 repeat sequences could be used 
to down-regulate the expression of CA125. Further, antisense therapy could be used in 
association with a tumor cell delivery system of the type described above. 

[0015] Recombinant domains of the CA125 molecule also have the potential to identify 
small molecules, which bind to individual domains of the CA125 molecule. These small 
molecules could also be used as delivery agents or as biological modifiers. 

[0016] In another aspect of the present invention, an isolated nucleic acid of the CA125 gene 
is disclosed, which comprises a nucleotide sequence selected from the group consisting of: (a) 
the nucleotide sequences set forth in SEQ ID NOS : 1, 2, 3 and 4; (b) a nucleotide sequence 
having at least 70% sequence identity to any one of the sequences in (a); (c) a degenerate variant 
of any one of (a) to (b); and (d) a fragment of any one of (a) to (c). 

[0017] In another aspect of the present invention, an isolated nucleic acid of the CA125 gene, 
comprising a sequence that encodes a polypeptide with the amino acid sequence selected from 
the group consisting of: (a) the amino acid sequences set forth in SEQ ID NO: 5; (b) an amino 
acid sequence having at least 50% sequence identity to any one of the sequences in (a); (c) a 
conservative variant of any one of (a) to (b); and (d) a fragment of any one of (a) to (c). 

[0018] The vector may be a cloning vector, a shuttle vector, or an expression vector. A 
cultured cell comprising the vector is also contemplated. 

[0019] More specifically, this invention relates to a purified antibody that selectively binds to 
an epitope in the CA125 protein of SEQ ID NO: 5. Similarly, the purified antibody selectively 
binds to an amino acid sequence having at least 50% sequence identity to said sequence; the 
purified antibody selectively binds to an amino acid sequence having at least 60% sequence 
identity to said sequence; the purified antibody selectively binds to an amino acid sequence 
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having at least 70% sequence identity to said sequence; the purified antibody selectively binds to 
an amino acid sequence having at least 80% sequence identity to said sequence; and the purified 
antibody selectively binds to an amino acid sequence having at least 90% sequence identity to 
said sequence. Additionally, purified antibody can be a conservative variant of the amino acid 
sequence set forth m SEQ ID NO: 5 or a fingment thereof. 
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DETAILED DESCRIPTION OF THE INVENTION 

[0020] In accordance with the present invention, conventional molecular biology, 
microbiology, and recombinant DNA techniques may be used that will be apparent to those 
skilled in the relevant art. Such techniques are explained fully in the literature (see, e.g., 
Maniatis, Fritsch & Sambrook, "Molecular Cloning: A Laboratory Manual (1982); "DNA 
Cloning: A Practical Approach," Volumes I and H (D. N. Glover ed. 1985); "OUgonucleotide 
Synthesis" (M. J. Gait ed. 1984); "Nucleic Add Hybridization" (B. D. Hames & S. J. Higgins 
eds. (1985)); "Transcription and Translation" (B. D. Hames & S. J. Higgins eds. (1984)); 
"Animal Cell Culture" (R. I. Freshney, ed. (1986)); "ImmobiUzed Cells And Enzymes" (JBL 
Press, (1986)); and B. Perbal, "A Practical Guide To Molecular Cloning" (1984)). 

[0021] Therefore, if appearing herein, Ae following terms shall have the definitions set out 
below. 

[0022] A "vector" is a replicon, such as plasmid, phage or cosmid, to which another DNA 
segment may be attached so as to bring about the replication of the attached segment. 

(0023J A "DNA molecule" refers to the polymeric form of deoxyribonucleotides (adenine, 
guanine, thymine, or cytosine) ui either single stranded fonn, or a double-stranded helix. This 
temi refers only to the primary and secondary structure of the molecule, and does not limit it to 
any particular tertiary forms. Thus, this term includes double-stranded DNA found, mter alia, in 
hnear DNA molecules (e.g., restriction fragments), viruses, plasmids, and chromosomes. 

[00241 As used herein, the term "gene" shall mean a region of DNA encoding a polypeptide 
chain. 

[0025] "Messenger RNA" or "mRNA" shall mean an RNA molecule that encodes for one or 
more polypeptides. 

10026] "DNA polymerase" shall mean an enzyme which catalyzes the polymerization of 
deoxyribonucleotide triphosphates to make DNA chains using a DNA template. 

8 



wo 2004/045553 PCTAJS2003/037041 



[0027] "Reverse transcriptase" shall mean an enzyme which catalyzes the polymerization of 
deoxy- or ribonucleotide triphosphates to make DNA or RNA chains using an RNA or DNA 
template. 

[00281 "Complementary DNA" or "cDNA" shall mean the DNA molecule synthesized by 
polymerization of deoxyribonucleotides by an enzyme with reverse transcriptase activity. 

[0029] An "isolated nucleic acid" is a nucleic acid the structure of which is not idaitical to 
that of any naturally occurring nucleic acid or to that of any fragment of a naturally occurring 
genomic nucleic acid spanning more than three separate genes. The term therefore covers, for 
example, (a) a DNA which has the sequence of part of a naturally occurring genomic DNA 
molecule but is not flanked by botti of the coding sequences tiiat flank that part of the molecule 
in the genome of the organism in which it naturally occurs; (b) a nucleic acid incorporated into a 
vector or into the genomic DNA of a prokaryote or eukaryote in a manner such that the resulting 
molecule is not identical to any naturally occurring vector or genomic DNA; (c) a separate 
molecule such as a cDNA a genomic fragmail; a fragment produced by polymerase chain 
reaction (PCR), or a restriction fragment; and (d) a recombinant nucleotide sequence that is part 
of a hybrid gene, i.e., a gems encoding a frision protein. 

[0030] "Oli^nucleotide", as used herein in referring to the probes or primers of die present 
invention, is defined as a molecule conqjrised of two or more deoxy- or ribonucleotides, 
preferably more than ten. Its exact size will depend i^on many factors which, in turn, depend 
upon the ultimate function and use of the oligonucleotide. 

[0031] "DNA fragment" includes polynucleotides and/or oligonucleotides and refers to a 
plurality of joined nucleotide units formed from naturally-occurring bases and cyclofiiranosyl 
groiq>s joined by native phosphodiester bonds. ITus term effectively refers to naturally- 
occurring species or synthetic species formed from naturally-occurring subunits. "DNA 
fragment" also refers to purine and pyrimidine groups and moieties which fimction similarly but 
which have non naturally-occurring portions. Thus, DNA fragments may have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
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sulfur containing species. They may also contain altered base units or other modifications, 
provided that biological activity is retained. DNA fragments may also include species which 
include at least some modified base forms. Thus, purines and pyrimidines other than those 
normally found in nature may be so employed. Similarly, modifications on the cyclofuranose 
portions of the nucleotide subunits may also occur as long as biological function is not 
eliminated by such modifications, 

[0032] "Primer" shall refer to an oligonucleotide, whether occurring naturally or produced 
synthetically, which is capable of acting as apomt of initiation of synthesis when placed imder 
conditions in which synthesis of a primer extension product, which is complementary to a 
nucleic acid strand, is induced, i.e., in the presence of nucleotides and an inducmg agent such as 
a DNA polymerase and at a suitable temperature and ptt The primer may be either single- 
stranded or double-stranded and must be sufficiently long to prime the synthesis of the desired 
extension product in the presence of the inducing agent The exact length of tiie primer will 
depend upon many factors, including temperature, the source of primer and the method used. 
For example, for diagnostic applications, depending on the complexity of the target sequence, the 
oligonucleotide primer typically contains 10-25 or more nucleotides, although it may contain 
fewer nucleotides. . 

[0033] The primers herein are selected to be "substantially" complementary to different 
strands of a particular target DNA sequence. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. Therefore, the primer sequence need 
not reflect the exact sequence of the template. For example, a non-complementary nucleotide 
Augment may be attached to the 5' end of the primer, with the remainder of the primer sequence 
bemg complementary to the strand. Alternatively, non-complementary bases or longer 
sequences can be interspersed into the primer, provided that the primer sequence has suflBcient 
complementarity with the sequence or hybridize therewith and thereby form the template for the 
synthesis of the extension product. 

(00341 As used herein, the terai "hybridization" refers generally to a technique wherein 

denatured RNA or DNA is combined with complementary nucleic acid sequence which is either 
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free in solution or bound to a solid phase. As recognized by one skilled in the art, complete 
complementarity between the two nucleic acid sequences is not a pre-requisite for hybridization 
to occur. The technique is ubiquitous in molecular genetics and its use centers aroimd the 
identification of particular DNA or RNA sequences within complex mixtures of nucleic acids. 

[0035] As used herein, "restriction endonucleases" and "restriction enzymes" shall refer to 
bacterial enzymes which cut double-stranded DNA at or near a specific nucleotide sequence. 

[0036] *Turified polypeptide" refers to any peptide generated from CA125 either by 
proteolytic cleavage or chemical cleavage. 

[0037] ^'Degenerate variant" refers to any amino acid variation in the repeat sequence, which 
fulfills the homology exon structure and conserved sequences and is recognized by the Ml 1, 
OC125 and BOBM series of antibodies. 

[0038] 'Tragment" refers to any part of the CA125 molecule identified in a purification 
scheme. 

[0039] "Conservative variant antibody*' shall mean any antibody that fiilfills the criteria of 
Ml 1, OC125 or any of the ISOBM antibody series. 

[0040] "Homology" refers to similarity based on identical base matches in alignment. When 
two sequences are identical there is a 100% homology, as base matches differ in alignment the 
homology between two sequmces is reduced. 

[0041] The CA125 gene has been cloned and multiple repeat sequences as well as the 
caiboxy terminus have been identified. The genomic DNA for the CA125 gene is set out in SEQ 
ID NO: 4. The CA125 molecule comprises three major domains: an extracellular amino terminal 
domain (Domain 1); a large multiple repeat domain (Domam 2); and a caiboxy terminal domain 
(Domain 3) which includes a transmanbrane anchor with a short cytoplasmic domain. The 
amino terminal domain is assembled by combining five genomic exons, four very short amino 
terminal sequences and one extraordinarily large exon. This domain is dominated by its edacity 
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for 0-glycosylation and its resultant richness in serine and threonine residues. Additionally, an 
amino terminal extension is present, which comprises four genomic exons. The amino acid 
composition of the amino terminal extension was found to be consistent with the amino acid 
composition of the amino terminal domain. The molecular structure is dominated by a repeat 
domain comprising 156 amino acid repeat units, which encompass the epitope binding sites. 
More than 60 repeat units have been identified, sequenced, and contiguously placed in the 
CA125 domain structure. The repeat imits encompass an int^active disulfide bridged C- 
enclosure and the site of OC125 and Ml 1 binding. The repeat sequences demonstrated 70-85% 
homology to each other. Expression of the repeats was demonstrated in E. colt The CA125 
molecule is anchored at its carboxy terminal through a transmembrane domain and a short 
cytoplasmic tail. The carboxy terminal also contains a proteolytic cleavage site approximately 
50 amino acids upstream firom the transmembrane domain, which allows for proteolytic cleavage 
and release of the CA125 molecule. Any one of the repeat domains has the potential for use as a 
new gold standard for detecting and monitoring the presence of the CA125 antigen. Further, the 
repeat domains or other domains, especially the c-terminal to the repeat domain also provide a 
basis for the development of a vaccine, which would be useful for the treatment of ovarian 
cancer and other carcinomas where CA125 is elevated. 

[0042] The DNA sequences of the present invention c m also be characterized as encoding 
the amino acid sequence equivalents of the amino acid sequence, equivalents, as used in this 
context, include peptides of substantially similar length and amino acid identity to those 
disclosed, but having conservative amino acid substitution at a non-critical residue position. A 
conservative amino acid substitution is a substitution in which an amino acid residue is replaced 
with an amino acid residue of differing identity, but whose R group can be characterized by 
chemically similar. Four common categories include: polor but uncharged R groups; positively 
charged R groups; negatively charged R groups; and, hydrophobic R groups. A preferred 
conservative substitution involves the substitution of a second hydrophobic residue for a fir 
hydrophobic residue, the first and second hydrophobic residues differing primarily in the size of 
the R group. The hydrophobic residue would be predicted to be located intemally in the folded 
pepetide structure and the mild pertubatun caused only by a change in the size of an R group at 
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an internally located which would not alter the antigenecity of R protein. More specifically, two 
nucleic acid molecules are substantially equivalent if they have at least about 70% homology. 

[0043] The isolated cDNA sequences (Table 4 and SEQ ID NO: 4) of the present invention 
can be inserted into an expression vector. Such vectors contain all necessary regulatory signals 
to promote the expression of a DNA sequence of interest. Expression vectors are typically either 
prokaryote or eukaryote specific. Expression vectors can be introduced into either prokaryote or 
eukaiyote cells to produce CA125 proteins or portions thereof. The isolated cDNA sequence as 
shown in Table 4 was expressed to provide the CA125 molecule set out in Table 5 and SEQ ID 
NO: 5, The expressed CA125 is a polypeptide with the amino acid sequence set forth in SEQ ID 
NO: 5; an amino acid sequence having at least 50% sequence identity to the sequence, a 
conservative for variant or a fragment of any of the above. Two polypeptide sequences are 
substantially equivalent if there is at least 50% sequence homology and substantial similar 
physical characteristics. However, in practice, a portion of an isolated nucleic acid molecule set 
out in SEQ ID NO: 4 is expressed to obtain a fragment of the CA125 molecule. This fragment is 
then purified to obtain an isolated CA125 fragment. 

[0044] In certain embodiments, "purified" refers to a polypeptide composition which has 
been subjected to firactionation to remove various nonprotein components such as other cellular 
Components. Various techniques suitable for use in protein purification are known to those 
skilled in the relevant art. These techniques include, for example, precipitation with ammonium 
sulphate, PEG, antibodies and the Uke or by heat denaturation, followed by centrifiigation; 
chromatography steps such as ion exchange, gel filtration, reverse phase, hydroxylapatite and 
affinity chromatography; isoelectric focusing; gel electrophoresis; and combination of such 
techniques. Similarly, a "purification scheme" is a technique or system to remove various 
nonprotein components such as other cellular components from the expressed protein. Various 
techniques suitable for use hi protein purification are known to those skilled in the relevant art. 
These techniques include, for example, precipitation with ammonium sulphate, PEG, antibodies 
and the like or by heat denaturation, followed by centrifiigation; chromatography steps such as 
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ion exchange, gel filtration, reverse phase, hydroxylapatite and affinity chromatography; 
isoelectric focusing; gel electrophoresis; and combination of such techniques. 

[0045] The genomic DNA and a fuU-lmgth cDNA sequence of human CA125 has been 
determined. Additionally, a nucleic acid molecule encoding a 5' upstream region of the CA125 
gene has been determined cDNA is expressed with the use ofan expression vector. An 
expression vector is a carrying vector that has an inducer for expression built into the vector- 
Different vectors use different inducers. The cDNA is ligased into the expression vector using 
restriction digest sites designed in the vector. The cDNA must be ligased in the sense direction 
and in the correct reading firame for expression to occur. Once the cDNA is ligased into the 
expression vector, the construct is transformed into a cell. In the preferred embodiment, we use 
E-Coli bacteria, but the transformation can be done with yeast, mammalian cells, plants cells, 
etc. The transformed cells are then grown in culture and protein production is induced with the 
an mducing agent for the expression vector In the preferred embodiment, we use the pQE-30 
expression vector and induce with BPTG. Once induction has occurred, the cells are harvested 
and the protein is purified. It should be noted that some expression vectors add tags to the 
recombinant protein to aid in purification. For example, pQE-30 adds a His-Tag which binds to 
nickel to aid in purification. Once cells have been successfixUy transformed, a small aliquot can 
be fit)zen and stored for future use. 

[0046] With a cDNA sequence, one skilled in the art has an easily renewable source of 
purified CA125. Portions of this cDNA sequence can be expressed to make CA125 polypeptides 
and these polypeptides can be used to make monoclonal antibodies. These monoclonal 
antibodies can be made by one skilled in the art to portions of the protein which heretofore do 
not have any monoclonal antibodies, such as the amino terminal sequence. 

[0047] More specifically, the purified antibodies are made by the following process: the 

recombinant protein is injected into an animal (usually a mouse, but other animals can be used). 

The animal's B-lymphocytes produce antibodies to the protein. Each activated B lymphocyte 

forms a clone of cells in spleen or lymph nodes, with each cell of the clone producing identical 

antibody. These spleen cells are then harvested and fiised with myeloma cells to produce 
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hybridomas. These hybridomas are inunortal and produce only one type of antibody. The 
hybridomas are selected from cells that did not fuse by selective media. The hybidomas can then 
be grown in large quantities to produce large quantities of monoclonal antibodies. 

[0048] Therapeutic Targets: Molecules, which are expressed on the surface of tumor cells as 
CA125 is, offer potential targets for immune stimulation, drug delivery, biological modifier 
delivery or any agent which can be specifically delivered to ultimately kill the tumor cells. 
CA125 offers such potential as a target: 1) Antibodies to CA125 epitopes or newly described 
potential epitopes: Most especially humanized or human antibodies to CA125 which could 
directly activate the patients' unmune system to attack and kill tumor cells. Antibodies could be 
used to deliver all dmg or toxic agents including radioactive agents to mediate direct killing of 
tumor cells. 2) Natural ligands: Under nomiai circumstances, molecules are bound to the CA125 
molecule e.g. a 50 k dalton protein which does not contain CA125 q)itopes co-purifies with 
CA12S. Such a molecule, which might have a natural binding a£Qnity for domains on the 
CA12S molecule, could also be utilized to deliver therapeutic agents to tumor cells. 

[0049] Anti-sense th^apy: CA12S expression may provide a survival or metastatic 
advantage to ovarian tumor cells as such antisense oligonucleotide derived from the CA125 
sequence could be used to down-regulate the expression of CA12S. Antisense therapy could be 
used in association with a tumor cell delivery system such as described above. 

[0050] Small Molecules: Recombinant domains of CA12S also offer the potential to identify 
small molecules which bind to individual domains of the molecule. Small molecules either from 
combinatorial chemical libraries or small pq[>tides can also be used as delivery agents or as 
biological modifiers. 

[0051] Transgenic Animals/Transformed: CA125 and genomic DNA can be used to develop 
transgenic animal models and can be used under low stringency conditions, to clone CA125 
cDNAs and genomic DNAs of other animal species. The CA12S cDNA can be used to prepare 
stable transfomiants. The bacterial cells could be transformed with CA125 cDNA to include 
these genes. 
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Example 1 



[0052] Expression of 6xHis-tagged CA125 repeat in E. coU: The open reading frame of the 
CA125 repeat was amplified by PGR with the 5' sense primer 1 (5*- 

ACCGGATCCATGGGCCACACAGAGCCTGGCCC-3*) (SEQ ID NO: 6) and the 3' antisense 
primer 2 (5'-TGTAAGCTTAGGCAGGGAGGATGGAGTCC-3') (SEQ ID NO: 7). The open 
reading frame of the CA125 repeat constitutes a portion of the isolated nucleic acid molecule set 
out in SEQ ID NO: 4. PGR was performed in a reaction mixture consisting of ovarian tumor 
cDNA derived from 50 ng of mRNA, 5 pmol each of sense and antisense primers for the GA125 
repeat, 0.2 mmol of dNTPs, and 0.625 U of Taq polymerase in Ix buffer in a final volume of 25 
\il This mixture was subjected to 1 minute of denaturation at 95*'G followed by 30 cycles of 
PGR consisting of the following: denaturation for 30 seconds at 95*^0, 30 seconds of annealing at 

and 1 minute of extension at 72**C with an additional 7 minutes of extension on the last 
cycle. The product was electrophoresed through a 2% agarose gel for separation, the PGR 
product was purified and digested with the restriction enzymes BamHI and Hindm. This 
digested PGR product was ligated into the expression vector pQE-30, which had also been 
digested with BamHI and Hindm. The ligation reaction consisted of lul of lOx Ugation buffer, 
lul of lOx boyine serum albumm (BSA), lul of T4 ligase enzyme, and 7ul of digested PGR 
product. The reaction went overnight at IS"" G. This clone would allow for expression of 
recombinant amino-tenninal 6xHis-tagged GA125 repeat. The construct was then transformed 
into JM109 E. coli cells. The transformation reaction coiisisted of 5ul of ligation reaction and 
50ul of cells. The reaction was mixed gently and then incubated on ice for 30 minutes. The 
reaction was then heat shocked at 42" G for 45 seconds in a water bath before being returned to 
ice for 2 minutes. 500ul of LB broth media were added and the mixture was incubated at 37" G 
for 1 hour. At the end of 1 hour the cells were spun down in a microfuge at 4,000 x g for 5 
minutes and approximately 450ul of media was removed. The cells were resuspended in the 
remaining lOOul of media, transferred to a LAIX plate, and incubated overnight at 37" G. White 
colonies were picked from the plate and cultured in 5ml LB broth media to determine if the 
transformation was successful. Transformed E. coli were grown m IL cultures to an OD600 of 
1.5-2.0 at 37*^0 and were induced with IPTG (0,1 mM) for 4-6 hours at IS'^C to produce 
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recombinant protein. Whole E.coli lysate was electrophoresed through a denaturing 12% 
polyacrylamide gel and Coomassie stained to detect highly expressed proteins. His-tagged 
proteins were purified under denaturing conditions using Ni-NTA agarose metal chelating 
afOnity chromatography available from Qiagen according to the manufacturer's instructions. 
Cells are spun down to remove liquid LB broth media. The ceUs are then resuspended in 40ml 
of 8M Urea lysis buffer (pH 8.0) and incubated with agitation overnight at room temperature. 
The mixture is then spun down and the lysate is removed. The lysate is then incubated with Ni- 
NTA agarose beads with agitation overnight at room temperature. The beads are pelleted and the 
supernatant is removed. The beads are then washed twice in Ij^is buffer pH 8.0 plus Triton X, 
three times with lysis buffer pH 6.3 plus Triton X, and four times with Ij^is buffer pH 6.3 
without Triton X. The protein is then eluted from the beads with lysis buffer pH4.2 plus 25mM 
EDTA incubated overnight with agitation at room temperature. The beads are pelleted and the 
supernatant containing the recombinant protein is removed. The supernatant is then dialyzed 
twice in .2x PBS to remove the urea and freeze dried for storage. IL of culture produced 2.4mg 
of recombinant protein. 

[00531 It should be understood that various changes and modifications to the presently 
preferred embodiments described herein will be apparent to those skilled in the art. Such 
changes and modifications can be made without departing from the spirit and scope of the 
present invention and without dimmishing its attendant advantages. 
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Table 1 



Genomic C2Q25 Amino Terminal Sequence 
(SEQ ID NO: 1) 



1 


GGTGCGCACC 


ACTATGTCTG 


GCTAATTTTT 


GTATTTTTTT 


GTAGAGACAT 


51 


GGTTTCACCA 


TGTTGGCCAG 


GCTGGTCTCG 


AATTCCTGAC 


TTCAAGTAAT 


101 


CCACCCACCT 


CAGCCTCCCA AAGTGCTG6G ATTACAAGCA TGAGCCACCA 


151 


TGCATGGCCT 


AAAGCTTCTT 


TTAAAGCCAC 


CAAGTCCCTT 




201 


CCCACTAATC 


CATGGGTTAG 


TCATGAATGG 


ATTAATCTAT 




251 


AGAGCCCTCA 


TCACCCAATC 


ACCTCTTAAA 


GGCCCCACCT 


CTCAATACTG 


301 


CCACACTGGG 


GATTAAGTTT 


CAACAGAGTT 


TTGGAGGGGA 


CATTCAAATC 


351 


ATAGTAATGC 


CCAAAGTGAA AAATCTTCCC 


TGCACTTTTC 


CCTCAACAAA 


401 


AACAGCCAGA 


GATAGTGAGC 


TGCCAGGAAA 




TTTCCTCTTC 


451 


TGTCCTAAAT 


CAGCATCGCT 


AGACCTTTAC 


ATGATTCAAC 


CTCATCTTCT 


501 


TCACCCTCTG 


GGTCATGAAA 


TTTTATTTAT 


TTATTTATTA 


TTTTCTTGGG 


551 


ACAGACTCTG 


GCTCTGTCGC 


CCAGGCTGAA GTGCAGTGGT 


GTGATCTTGG 


601 


CTCACTGCAA CCTCCGCCTC 


CCGGGTTCAA 


GCGATTCTCC 


TGCCTCAGCC 


651 


TCCTGAGTAG 


CTGGGATTAC 


AGGTGGGCGC 


CACCACACCC 


A6CTAATTTT 


701 


TTGTATTTTT 


AGTAGAGATG 


GGGTTTCACC 


ATATTAGCCA 


GGATGGTCTC 


751 


CATCTCTTGA 


CCTCGTGATC 


TGCCCACCTC 


AGCCTCCCAA AATGCTGGGA 


801 


TTACAGGCAT 


GAGACACCAC 


GCCCAGCAGG 


CCAGGGTCAT 


GAGATTTTAA 


851 


TCAAGAGCAA 


CTTCCACTGA 


TTCCTGAGAG 


TGCATCTGTG 


GGCCCCTGCT 


901 


CTGATCTGAA 


CAGAAGTGCC 


GTGTCTTCTC 


TGACCTCCAC 


TTCTCAATTC 


951 


AAGAGCCTTA GTATCTGCCA 


GTATCACACA 


CTGAGCATTA 


GCTCCATCTC 
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Table 1 (continued) 



Genomic CA125 Amino Texoiinal Sequence 
(SEQ ID NO: 1) 



1001 

1051 

1101 

1151 

1201 

1251 

1301 

1351 

1401 

1451 

1501 

1551 

1601 

1651 

1701 

1751 

1801 

1851 

1901 

1951 



ATGGGGGTGT AGGTAGGGGC TCTATCTGCA TCTTTCTTTC TTTTTTTCTT 
TCTTTCCCTT CCTCCCTTCC TCACTCCCTC GGTCCTCTCT TTCTTTCCTT 
TTCTTTCTTC CTTCCTCCCT TCCTCCCTCC CTCCCTCTCT CTTTCTCTCT 
TTCTTTCTTT CCTTCTTTCT TTCTTTCTCT CTTCCTTCCC TCCCTCCCTC 
CTTCCTTCCT TTCTCTTTCT TTCTCTTTCT TTCTTTTTTT CCTTCCTTCC 
TTCCTTCTTT CTCTTTCTCT CCCTCCCTTC CTTCCTTCCT TCCTTCCTTC 
CTTCCTTTCT TTCTTTCTTT CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT 
TTCTTTCTTC CTTCCTTCCT TCCTTCCTTC CTTCCTTCCT TCCTTCCTTT 
CTTTTCTTTC TTTCTCTTTC TTTTTQAGAC AGAGCTCTTA TTACCCATGC 
TGGAGTGCAG TGGTGTGACC TTGGCTTACT GCAACATCTG CCTCCTAGGG 
TCAAGTGATT CTCCTGCCTC AGCCTCCTAA. GTAGGTGGGA TTACAGACAC 
ATGCCACCAC ACCCAATATT TATTTTTATT AAJWVTTTTTT TTAAAATTAT 
TTTTAAAAAA TTAAAAATAA TTTTGTATTT TTAGTAGAGA CGGGGTTTCT 
CCATGTTGGT CAGGCTGATC TCAAACTCCC AACCTCAGGT GATCCTCCCA 
CCTCACCTCC CAAAGTGCTG GGATTACAGG CATGAGCCAC C6T6CCCAGC 
CTGGTTCCTG GTTTCTAAGA CATCACACAC ACACACACAC ACACACACAC 
ACACTCACAC ACTCAGAGAG AGAGAGAGAG AGAGGATCAT TAAGACATGA 
TACACTAAGA AATTCTATTC TGCAGACACT GAGAATCCCT TAAAAAGTTT 
GAAGGGAAGA ATTGAGATCA TCAGGTGTTT ATTTGAGGAA ATTGTCTGTG 
GTTGAACTAT CCTTTCCTTT CTCTCCCTGA GATTTGGTCT TCTCAATTAG 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID MO: 1) 



2001 AAGCGTTGCA CAATTCCCCC AACCTCCATA CATACGGCAG CTCTTCTAGA 

2051 CACAGGTTTT CCCAGGTCAA ATGCGGGGAC CCCAGCCATA TCTCCCACCC 

2101 TGAGAAATTT TGGAGTTTCA GGGAGCTCAG AAGCTCTGCA GAGGCCACCC 

2151 TCTCTGAGGG GATTCTTCTT AGACCTCCAT CCAGAGGCAA ATGTTGACCT 



2201 GTC CATOCTG AAACCCTCAG GCCTTCCTGG GTCATCT TCT CCCACCCGCT 

2251 CCTTGATGAC AGGGAGCAGG AGCACTAAAG CCACACCAGA AATGGATTCA 

2301 GGACTGACAG GAGCCACCTT GTCACCTAAG ACATCTACAG GTGCAATCGT 

2351 GGT6ACAGAA CATACTCTGC CCTTTACTTC CCCAGATAAG ACCTTGGCCA 

2401 GTCCTACATC TTCGGTTGTG GGAAGAACCA CCCaflTCTTT GGGGGTGATG 

2451 TCCTCTGCTC TCCCTGAGTC AACCTCTAGA GGAATGACAC ACTCCGAGCA 

2501 AAGAACCAGC CCATCGCTGA GTCCCCAGGT CAATCGfACT CCCTCTAGGA 

2551 ACTACCCTGC TACAAGCATG GTTTCAQGAT TGAGTTCCCC AAGGACCAGG 

2601 ACCAGTTCCA CAGAAGGAAA TTTTACCAAA GAAGCA TCTA CATACACACT 

2651 CACTGTAQAG ACCACAAGTG GCCCAGTCAC TGAGAAG TAC ACAGTCCCCA 

2701 CTGAGACCTC AACAACTGAA GGTGACAGCA CAGAGAC CCC CTGGGACACA 

2751 AGATATATTC CTGTAAAAAT CACATCTCCA ATGAAAACAT TTGCAGATTC 

2801 AACTGCATCC AAGGAAAATG CCCCAGTGTC TATGACTCCA GCTGAGACCA 

2851 CAGTTACTGA CTCACATACT CCAGGRAGGA CAAACCCAT C ATTTGGGACA 

2901 CTTTATTCTT CCTTCCTTGA CCTATCACCT AAAGGGACCC CAAATTCCAG 



Sxonl 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SBQ ZD NO: 1) 



2951 


AGGTGAAACA AGCCTGGAAC TGATTCTATC AACCACTGGA TATCCCTTCT 


3001 


CCTCTCCTGA ACCTGGCTCT GCAGGACACA GCAGAATAAG TACCAGTGCG 


3051 


CCTTTGTCAT CATCTGCTTC AGTTCTCGAT AATAAAATAT CAGAGACCAG 


3101 


CATATTCTCA GGCCAGAGTC TCACCTCCCC TCTGTCTCCT GGGGTGCCCG 


3151 


AGGCCAGAGC CAGCACAATG CCCAACTCAG CTATCCCTTT TTCCATGACA 


3201 


CTAAGCAATG CAGAAACAAG TGCCGAAAGG GTCAGAAGCA CAATTTCCTC 


3251 


TCTGGGGACT CCATCAATAT CCACAAAGCA GACAGCAGAG ACTATCCTTA 


3301 


CCTTCCATGC CTTCGCTGAG ACCATGGATA TACCCAGCAC CCACATAGCC 


3351 


AAGACTTTGG CTTCAGAATG GTTGGGAAGT CCAGGTACCC TTGGTGGCAC 


3401 


CAGCACTTCA GCGCTGACAA CCACATCTCC ATCTACCACT TTAGTCTCAG 


3451 


AGGAGACCAA CACCCATCAC TCCACGAGTG GAAAGGAAAC AGAAGGAACT 


3501 


TTGAATACAT CTATGACTCC ACTTGAGACC TCTGCTCCTG GAGAAGAGTC 


3551 


CGAAATGACT GCCACCTTGG TCCCCACTCT AGGTTTTACA ACTCTTGACA 


3601 


GCAAGATCAG AAGTCCATCT CAGGTCTCTT CATCCCACCC AACAAGAQAG 


3651 


CTCAGAACCA CAGGCAGCAC CTCTGGGAGG CAGAGTTCCA GCACAGCTGC 


3701 


CCACGGGAGC TCTGACATCC TGAGGGCAAC CACTTCCAGC ACCTCAAAAG 


3751 


CATCATCATG GACCAGTGAA AGCACAGCTC AGCAATTTAG TGAACCCCAG 


3801 


CACACACAGT GGGTGGAGAC AAGTCCTAGC ATGAAAACAG AGAGACCCCC 


3851 


AGCATCAACC AGTGTGGCAG CCCCTATCAC CACTTCTGTT CCCTCAGTGG 


3901 


TCTCTGGCTT CACCACCCTG AAGACCAGCT CCACAAAAGG GATTTGGCTT 
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Table 1 (continued) 



Genomic CA125 Axoino Terminal Sequence 
(SEQ ID NO: 1) 



3951 


GAAGAAACAT CTGCAGACAC ACTCATCGGA GAATCCACAG CTGGCCCAAC 


4001 


CACCCATCAG TTTGCTGTTC CCACTGGGAT TTCAATGACA GGAGGCAGCA 


4051 


GCACCAGGGG AAGCCAGGGC ACAACCCACC TACTCACCAG AGCCACAGCA 


4101 


TCATCTGAGA CATCCGCAGA TTTGACTCTG GCCACGAACG GTGTCCCAGT 


4151 


CTCCGTGTCT CCAGCAGTGA GCAAGACGGC TGCTGGCTCA AGTCCTCCAG 


4201 


GAGGGACAAA GCCATCATAT ACAATGGTTT CTTCTGTCAT CCCTGAGACA 


4251 


TCATCTCTAC AGTCCTCAGC TTTCAGGGAA GGAACCAGCC TGGGACTGAC 


4301 


TCCATTAAAC ACTAGACATC CCTTCTCTTC CCCTGAACCA GACTCTGCAG 


4351 


GACACACCAA GATAAGCACC AGCATTCCTC TGTTGTCATC TGCTTCAGTT 


4401 


CTTGAGGATA AAGTGTCAGC GACCAGCACA TTCTCACACC ACAAAGCCAC 


4451 


CTCATCTATT ACCACAGGGA CTCCTGAAAT CTCAACAAAG ACAAAGCCCA 


4501 


GCTCAGCCGT TCTTTCCTCC ATGACCCTAA GCAATGCAGC AACAAGTCCT 


4551 


GAAAGAGTCA GAAATGCAAC TTCCCCTCTG ACTCATCCAT CTCCATCAGG 


4601 


GGAAGAGACA GCAGGGAGTG TCCTCACTCT CAGCACCTCT GCTGA6ACTA 


4651 


CAGACTCACC TAACATCCAC CCAACTGGGA CACTGACTTC AGAATCGTCA 


4701 


GAGAGTCCTA GCACTCTCAG CCTCCCAAGT GTCTCTGGAG TCAAAACCAC 


4751 


ATTTTCTTCA TCTACTCCTT CCACTCATCT ATTTACTAGT GGAGAAGAAA 


4801 


CAGAGGAAAC TTCGAATCCA TCTGTGTCTC AACCTGAGAC TTCTGTTTCC 


4851 


AGAGTAAGGA CCACCTTGGC CAGCACCTCT GTCCCTACCC CAGTATTCCC 


4901 


CACCATGGAC ACCTGGCCTA CACGTTCAGC TCAGTTCTCT TCATCCCACC 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NOt 1) 



4951 


TAGTGAGTGA GCTCAGAGCT ACGAGCAGTA CCTCAGTTAC AAACTCAACT 


5001 


GGTTCAGCTC TTCCTAAAAT ATCTCACCTC ACTGGGACGG CAACAATaTP 


5051 


ACAGACCAAT AGAGACACGT TTAATGACTC TGCTGCACCC 


CAAAGCACAA 


5101 


CTTGGCCAGA GACTAGTCCC AGATTCAAGA CAGGGTTACC 


TTCAGCAACA 


5151 


ACCACTGTTT CAACCTCTGC CACTTCTCTC TCTGCTACTG 


TAATGGTCTC 


5201 


TAAATTCACT TCTCCAGCAA CTAGTTCCAT GGAAGCAACT 


TCTATCAGGG 


5251 


AACCATCAAC AACCATCCTC ACAACAGAGA CCACGAATGG 


CCCAGGCTCT 


5301 


ATGGCTGTGG CTTCTACCAA CATCCCAATT GGAAAGGGCT ACATTArrrtA 


5351 


AGGAAGATTG GACACAAGCC ATCTGCCCAT TGGAACCACA 


GCTTCCTCTG 


5401 


AGACATCTAT GGATTTTACC ATGGCCAAAG AAAGTGTCTC 


AATGTCAGTA 


5451 


TCTCCATCTC AGTCCATGGA TGCTGCTGGC TCAAGCACTC 


CAGGAAGGAC 


5501 


AAGCCAATTC GTTGACACAT TTTCTGATGA TGTCTATCAT 


TTAACATCCA 


5551 


GAGAAATTAC AATACCTAGA GATGGAACAA GCTCAGCTOT 


GACTCCACAA 


5601 


ATGACTGCAA CTCACCCTCC ATCTCCTGAT CCTGGCTCTG 


CTAGAAGCAC 


5651 


CTGGCTTGGC ATCTTGTCCT CATCTCCTTC TTCTCCTACT 


CCCAAAGTCA 


5701 


CAATGAGCTC CACATTTTCA ACTCAGAGAG TCACCACAAG 


CATGATAATG 


5751 


GACACAGTTG AAACTAGTCG GTGGAACATG CCCAACTTAC 


CTTCCACGAC 


5801 


TTCCTTGACA CCAAGTAATA TTCCAACAAG TGGTGCCATA OGAAaaartra 


5851 


CCCTGGTTCC CTTGGACACT CCATCTCCAG CCACATCATT 


GGAGGCATCA 


5901 


GAAGGGGGAC TTCCAACCCT CAGCACCTAC CCTGAATCAA 


CAAACACACC 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SBQ ID NOi 1) 



5951 


CAGCATCCAC CTCGGAGCAC ACGCTAGTTC AGAAAGTCCA AGCACCATCA 


6001 


AACTTACCAT GGCTTCAGTA GTAAAACCTG GCTCTTACAC ACCTCTCACC 


6051 


TTCCCCTCAA TAGAGACCCA CATTCATGTA tCAACAGCCA GAATGGCTTA 


6101 


CTCTTCTGGG TCTTCACCTG AGATGACAGC TCCTGGAGAG ACTAACACTG 


6151 


GTAGTACCTG GGACCCCACC ACCTACATCA CCACTACGGA TCCTAAGGAT 


6201 


ACAAGTTCAG CTCAGGTCTC TACACCCCAC TCAGTGAGGA CACTCAGAAC 


6251 


CACAGAAAAC CATCCAAAGA CAGAGTCCGC CACCCCAGCT GCTTACTCTG 


6301 


GAAGTCCTAA AATCTCAAGT TCACCCAATC TCACCAGTCC GGCCACAAAA 


6351 


GCATGGACCA TCACAGACAC AACTGAACAC TCCACTCAAT TACATTACAC 


6401 


AAAATTGGCA GAAAAATCAT CTGGATTTGA GACACAGTCA GCTCCAGGAC 


6451 


CTGTCTCTGT AGTAATCCCT ACCTCCCCTA CCATTGGAAG CAGCACATTG 


6501 


GAACTAACTT CTGATGTCCC AGGGGAACCC CTGGTCCTTG CTCCCAGTGA 


6551 


GCAGACCACA ATCACTCTCC CCATGGCAAC ATGGCTGAGT ACCAGTTTGA 


6601 


CAGAGGAAAT GGCTTO^CA GACCTTGATA TTTCAAGTCC AAGTTCACCC 


6651 


ATGAGTACAT TTGCTATTTT TCCACCTATG TCCACACCTT CTCATGAACT 


6701 


TTCAAAGTCA QAGGCAGATA CCAGTGCCAT TAGAAATACA GATTCAACAA 


6751 


CGTTGGATCA GCACCTAGGA ATCAGGAGTT TGGGCAGAAC TGGGGACTTA 


6801 


ACAACTGTTC CTATCACCCC ACTGACAACC ACGTGGACCA GTGTGATTGA 


6851 


ACACTCAACA CAAGCACAGG ACACCCTTTC TGCAACGATG AGTCCTACTC 


6901 


ACGTGACACA GTCACTCAAA GATC2\AACAT CTATACCAGC CTCAGCATCC 
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Table 1 (continued) 



Genomic CA125 Axnlno Terminal Sequence 
(SEQ ID NO: 1) 



6951 


CCTTCCCATC TTACTGAAGT CTACCCTGAG CTCGGGACAC AAGGGAGAAG 


7001 


CTCCTCTGAG GCAACCACTT TTTGGAAACC ATCTACAGAC ACACTGTCCA 


7051 


GAGAGATTGA GACTGGCCCA ACAAACATTC AATCCACTCC ACCCATGGAC 


7101 


AACACAACAA CAGGGAGCAG TAGTAGTGGA GTCACCCTGG GCATAGCCCA 


7151 


CCTTCCCATA GGAACATCCT CCCCAGCTGA GACATCCACA AACATGGCAC 


7201 


TGGAAAGAAG AAGTTCTACA GCCACTGTCT CTATGGCTGG GACAATGGGA 


7251 


CTCCTTGTTA CTAGTGCTCC AGGAAGAAGC ATCAGCCAGT CATTAGGAAG 


7301 


AGTTTCCTCT GTCCTTTCTG AGTCAACTAC TGAAGGAGTC AGAQATTCTA 


7351 


GTAAGGGAAG CAGCCCAAGG CTGAACACAC AGGGAAATAC AGCTCTCTCC 


7401 


TCCTCTCTTG AACCCAGCTA TGCTGAAGGA AGCCAGATGA GCACAAGCAT 


7451 


CCCTCTAACC TCATCTCCTA CAACTCCTGA TGTGGAATTC ATAC3GGGGrA 


7501 


GCACATTTTG GACCSiAGQAG GTCACCACA6 TTATGACCTC AGACATCTCC 


7551 


AAGTCTTCAG CAAGGACAGA GTCCAGCTCA GCTACCCTTA TGTCCACAGC 


7601 


TTTGGGAAGC ACTGAAAATA CAGGAAAAGA AAAACTCAGA ACTGCCTCTA 


7651 


TGGATCTTCC ATCTCCAACT CCATCAATGG AG6TGACACC ATGGATTTCT 


7701 


CTCACTCTCA GTAATGCCCC CAATACCACA GATTCACTTG ACCTCAGCCA 


7751 


TGGGGTGCAC ACCAGCTCTG CAGGGACTTT GGCCACTGAC AGGTCATTGA 


7801 


ATACTGGTGT CACTAGAGCC TCCAGATTGG AAAACGGCTC TGATACCTCT 


7851 


TCTAAGTCCC TGTCTATGGG AAACAGCACT CACACTTCCA TGACTTACAC 


7901 


AGAGAAQAGT GAAGTGTCTT CTTCAATCCA TCCCCGACCT GAGACCTCAG 
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Table 1 (continued) 



OenoBtlc CA125 Amino Terminal Sequence 
(8BQ ID NOi 1) 



7951 


CTCCTGGAGC 


AGAGACCACT 


TTGACTTCCA 


CTCCTGGAAA 


CAGGGCCATA 


8001 


AGCTTAACAT 


TGCCTTTTTC 


ATCCATTCCA 


GTGGAAGAAG 


TCATTTCTAC 


8051 


AGGCATAACC 


TCAGGACCAG ACATCAACTC 


AGCACCCATG 


ACACATTCTC 


8101 


CCATCACCCC 


ACCAACAATT 


GTATGGACCA 


GTACAGGCAC 


AATTGAACAG 


8151 


TCCACTCAAC 


CACTACATGC 


AGTTTCTTCA 


GAAAAAGTTT 


CTGTGCAGAC 


8201 


ACAGTCAACT 


CCATATGTCA ACTCTGTGGC 


AGTGTCTGCT 


TCCCCTACCC 


8251 


ATGAGAATTC 


AGTCTCTTCT 


GGAAGCAGCA 


CATCCTCTCC 


ATATTCCTCA 


8301 


GCCTCACTTG 


AATCCTTGGA 


TTCCACAATC 


AGTAGGAGGA ATGCAATCAC 


8351 


TTCCTGGCTA 


TGGGACCTCA 


CTACATCTCT 


CCCCACTACA ACTTGGCCT^ 


8401 


GTACTAGTTT ATCTGAGGCA 


CTGTCCTCAG 


GCCATTCTGG 


GGTTTCAAAC 


8451 


CCAAGTTCAA 


CTACGACTGA 


ATTTCCACTC 


TTTTCAGCTG 


CATCCACATC 


8501 


TGCTGCTAAG 


CAAAGAAATC 


CAGAAACAGA 


GACCCATGGT 


CCCCAGAATA 


8551 


CAGCC6CGAG 


TACTTTGAAC 


ACTGATGCAT 


CCTCGGTCAC AGGTCTTTCT 


8601 


GAGACTCCTG 


TGGGGGCAAG 


TATCAGCTCT 


GAAGTCCCTC 


TTCCAATGGC 


8651 


CATAACTTCT 


AGATCAGATG 


TTTCTGGCCT 


TACATCTGAG AGTACTGCTA 


8701 


ACCCGAGTTT 


AGGCACAGCC 


TCTTCAGCAG 


GGACCAAATT 


AACTAGGACA 


8751 


ATATCCCTGC 


CCACTTCAGA 


GTCTTTGGTT 


TCCTTTAGAA TGAACAAGGA 


8801 


TCCATGGACA 


GTGTCAATCC 


CTTTGGGGTC 


CCATCCAACT 


ACTAATACAG 


8851 


AAACAAGCAT 


CCCAGTAAAC AGCGCAGGTC 


CACCTGGCTT 


GTCCACAGTA 


8901 


GCATCAGATG 


TAATTGACAC 


ACCTTCAGAT 


GGGGCTGAGA 


GTATTCCCAC 
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Table 1 (contbiaed) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



8951 


TGTCTCCTTT 


TCCCCCTCCC 


CTGATACTGA AGTGACAACT 


ATCTCACATT 


9001 


TCCCAGAAAA GACAACTCAT 


TCATTTAGAA 


CCATTTGATC 


TCTCACTCAT 


9051 


GAGTTGACTT 


CAAGAGTGAC 


ACCTATTCCT 


GGGGATTGGA 


TGAGTTCAGC 


9101 


TATGTCTACA AAGCCCACAG 


GAGCCAGTCC 


CTCCATTACA 


CTGGGAGAGA 


9151 


GAAGGACAAT 


CACCTCTGCT 


GCTCCAACCA 


CTTCCCCCAT 


AGTTCTCACT 


9201 


GCTAGTTTCA 


CAGAGACCAG 


CACAGTTTCA 


CTGGATAATG 


AAACTACAGT 


9251 


AAAAACCTCA 


GATATCCTTG ACGCACGGAA AACAAATGAG 


CTCCCCTCAG 


9301 


ATAGCAGTTC 


TTCTTCTGAT 


CTGATCAACA 


CCTCCATAGC 


TTCTTCAACT 


9351 


ATGGATGTCA 


CTAAAACAGC 


CTCCATCAGT 


CCCACTAGCA 


TCTCAGGAAT 


9401 


GACAGCAAGT 


TCCTCCCCAT 


CTCTCTTCTC 


TTCAGATAGA 


CCCCAGGTTC 


9451 


CCACATCTAC 


AACAGAGACA AATACAGCCA 


CCTCTCCATC 


TGTTTCCAGT 


9501 


AACACCTATT 


CTCTTGATGG 


GGGCTCCAAT 


GTGGGTGGCA 


CTCCATCCAC 


9S51 


TTTACCACCC 


TTTACAATCA 


CCCACCCTGT 


CGAGACAAGC 


TCGGCCCTAT 


9601 


TAGCCTGGTC 


TAGACCAGTA AGAACTTTCA 


GCACCATGGT 


CAGCACTGAC 


9651 


ACTGCCTCCG 


GAGAAAATCC 


TACCTCTAGC 


AATTCTGTGG 


TGACTTCTGT 


9701 


TCCAGCACCA 


GGTACATGGA 


CCAGTGTAGG 


CAGTACTACT 


GACTTACCTG 


9751 


CCATGGGCTT 


TCTCAAGACA AGTCCTGCAG 


GAGAGGCACA 


CTCACTTCTA 


9801 


GCATCAACTA TTGAACCAGC 


CACTGCCTTC 


ACTCCCCATC 


TCTCAGCAGC 


9851 


AGTGGTCACT 


GGATCCAGTG 


CTACATCAGA AGCCAGTCTT 


CTCACTACX3A 


9901 


GTGAAAGCAA 


AGCCATTCAT 


TCTTCACCAC 


AGACCCCAAC 


TACACCCACC 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



9951 


TCTGGAGCAA ACTGGGAAAC TTCAGCTACT 


CCTGAGAGCC TTTTGGTAGT 


10001 


CACTGAGACT TCAGACACAA CACTTACCTC AAAGATTTTG GTCACAGATA 


10051 


CCATCTTGTT TTCAACTGTG TCCACGCCAC 


CTTCTAAATT TCCAAGTACG 


10101 


GGGACTCTGT CTGGAGCTTC CTTCCCTACT 


TTACTCCCGG ACACTCCAGC 


10151 


CATCCCTCTC ACTGCCACTG AGCCAACAAG TTCATTAGCT ACATCCTTTG 


10201 


ATTCCACCCC ACTGGTGACT ATAGCTTCGG ATAGTCTTGG CACAGTCCCA 


10251 


GAGACTACCC TGACCATGTC AGAGACCTCA AATGGTGATG CACTGGTTCT 


10301 


TAAGACAGTA AGTAACCCAG ATAGGAGCAT 


CCCTGGAATC ACTATCCAAG 


10351 


GAGT7VACAGA AAGTCCACTC CATCCTTCTT 


CCACTTCCCC CTCTAAGATT 


10401 


GTTGCTCCAC GGAATACAAC CTATGAAGGT 


TCGATCACAG TGGCACTTTC 


10451 


TACTTTGCCT GCGGGAACTA CTGGTTCCCT 


TGTATTCAGT CAGAGTTCTG 


10501 


AAAACTCAGA GACAACGGCT TTGGTAGACT 


-^TCAGCrGG GCTTGAGAGG 


10551 


GCATCTGTGA TGCCACTAAC CACAGGAAGC 


CAGGGTATGG CTAGCTCTGG 


10601 


AGGAATCAGA AGTGGGTCCA CTCACTCAAC 


TGGAACCAAA ACATTTTCTT 


10651 


CTCTCCCTCT GACCATGAAC CCAGGTGAGG 


TTACAGCCAT GTCTGAT^TC 


10701 


ACCACGAACA GACTGACAGC TACTCAATCA ACAGCACCCA AAGGGATACC 


10751 


TGTGAAGCCC ACCAGTGCTG AGTCAGGCCT 


CCTAACACCT GTCTCTGCCT 


10801 


CCTCT^AGCCC ATCAAAGGCC TTTGCCTCAC 


TGACTACAGC TCCCCCAACT 


10851 


TGGGGGATCC CACAGTCTAC CTTGACATTT 


GAGTTTTCTG AGGTCCCAAG 


10901 


TTTGGATACT AAGTCCGCTT CTTTACC7VAC 


TCCTGGACAG TCCCTGAACA 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
ID NOi 1} 



10951 


CCATTCCAGA CTCAGATGCA AGCACAGCAT CTTCCTCACT GTCCAAGTCT 


11001 


CCAGAAAAAA ACCCAAGGGC AAGGATGATG ACTTCCACAA AGGCCATAAG 


11051 


TGCAAGCTCA TTTCAATCAA CAGGTTTTAC TGAAACCCCT GAGGGATCTG 


11101 


CCTCCCCTTC TATGGCAGGG CATGAACCCA GAGTCCCCAC TTCAGGAACA 


11151 


GGGGACCCTA GATATGCCTC AGAGAGCATG TCTTATCCAG ACCCAAGCAA 


11201 


GGCATCATCA GCTATGACAT CGACCTCTCT TGCATCAAAA CTCACAACTC 


11251 


TCTTCAGCAC AGGTCAAGCA GCAAGGTCTG GTTCTAGTTC CTCTCCCATA 


11301 


AGCCTATCCA CTGAGAAAGA AACAAGCTTC CTTTCCCCCA CTGCATCCAC 


11351 


CTCCAGAAAG ACTTCACTAT TTCTTGGGCC TTCCATGGCA AGGCAGCCCA 



[0054] 11401 ACATATTGGT GCATCTTCAG ACTTCAGCTC TGACACTTTC 

TCCAACATCC 

11451 ACTCTAAATA TGTCCCAGGA GGAGCCTCCT GAGTTAACCT CAAGCCAGAC 

11501 CATTQCAGAA GAAGAGGGAA CAACAGCTGA AACACAGACG TTAACCTTCA 

11551 CACCATCTGA GACCCCAACA TCCTTGTTAC CTGTCTCTTC TCCCACAGAA 

llSOl CCCACAGCCA GAAGAaAGAG TTCTCCAGAA ACATGGGCAA GCTCTATTTC 

11651 AGTTCCTCCC AAGACCTCCT TGGTTGAAAG TAAGAATGCC CTGCTCCTTC 

11701 CCCAAGTGTG CTGGGGATGA ATCTGQAAAT AAACTACATC TTTTTTATTT 

11751 TTTAAACTTT TATATTTGAA AATATAAATA TTTTAGGTTC AGGGAACATG 

11801 TGCAGGTTTG TTATATAGGT AAATTGCATG TCATGGGGGC TTGGGGTACA 

11851 GATTACATCA TCAGCCAGGT AATAAGCCTA GTACCTGATC AGTAGATTTT 

11901 TTTTAATCCT CTCCCTCCTC CCAGCCTCCA CCCTCAATTC ACATGTCTCC 



wo 2004/045553 



PCTAJS2003/037041 





Table 1 (continued) 



Genomic CA12S ADoino Terminal Sequence 
(SEQ ID NO: 1) 



11951 ATGTGTACTC AAGGTTTAAT TCCCACTTAT GAGTGAGAAC ATGCGGTATT 

12001 TGTAAACTAC ATCTTTATTT TTGCTAACCT CGAACTGAAA TTTAGCATTT 

12051 GTTTTATTGA TGAATAGAG6 TAACAAAACA AACCACATTA ATCCTAGCAG 

12101 TGCCTGTGCC TTTGCCAACA ACAGAAATTC CGGACACTTT CATATCCTAT 

12151 GACAATTGTT GCAAGCACTT TTAAAAATCA TGTACGACTT TATTCATAAT 

12201 TATAGTGGTT ATTAGGCTTT TCAATAGATC TTATTTAATG AGTTAGTAAA 

12251 ATAAGTGCCT GTATTATTGT ATTACATTTG TTTATTAAGA TCTTGATAAC 

12301 AACATTTCAA TATAATCATT TCCTTTGTTT TTTAAATTTT* AGATTCAGGG 

12351 GTATATGTGC AGGTTTGTTA CGTGGATATA CTGCATAATG ATGAGGTTTG 

12401 GCTTCTAGTG AACCCATCAG CCAAATAGTG AATGTTGTGC CCAATAAGTA 

12451 GTTTTTCAAT CCTCACTTCA CTCCCAGCCT CCTCTATTTT GGAGTCCCAG 

12501 TGTCTATTAT TTCTATCTTT ATGTCCACAT GTACCCATTG GTTAGCTCCC 

12551 ACTTATAAGT GAGAATGTGC AGTATTTAAT TTTCTGTTTT TGAGTTATTT 

12601 TGCTTAGGTT GATGGCCTTC AGCTCCAGCC ACGTTGCTTT AAAGAACATG 

12651 ATTTCATTCT TTTTTATGGC TGCATAGTAC TCCGAGGTGT ATGTGTACCA 

12701 GATTTTCTTT ATCCACAATG ATTTCCTTTG TAATCTAATA TTTTATATTG 

12751 TTATTTTATG TTTTATTCTA TATTTTTATT TTAATTTATA AAGGAATTCA 

12801 TATGGTTCAC AAGCCTGTCA AAGGGACCTA TAATAAAAAG AGGTTAAGAA 

12851 TCCATGCTCT AAACAGAATA TTACTCCATT TTATTTCATT TATTTTTAAA 

12901 GAGACAGTCT CACTCTGTCA TCCAGGCTGG AGTACAGTGG AGTGATCATA 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID MO: 1) 



12951 GCTCATTGCA ACCCTGAACT CTTGGGCACA AGCAATTCTC CTGCTTCATC 

13001 CTCCAGAGGA GCTGG6ACTA CAGGTGCACA TCACCATGCC CAGCTAGTTT 

13051 TAAAAATTAT TTTGTAGAGA TGGTGTCTCA CTATCCTACC CAGGCTGGTC 

13101 TCAAACTCCT GGGCTCAGGC AATCCTCCCA CTTTGACCTC CCAAAGTGTT 

13151 GAGATTACAG GGGCAAGCCA CTGTGCCTGG CCACTTGTCA CATTTTAATT 

13201 TGTGATTACT TATAAAATGA ACCCCTTCCC ATCTGAGATC TGTCAGTCTT 

13251 TCTGGTGACG GTGCCTGGTG TCTGCTTTCT ACCATGTCCT GTTAGACTAG 

13301 TGTTTGATGG GAGGTCACCT GGGCAGCTGT CCAGCTCACT CACTGGGCTC 

13351 TAGAGCCTCT GAGTTGAAGC AAAATAGAAA GATCAGTCAA TGTAAAGAAA 

13401 GCTCAAAAAC TGACATTCTG AAGTAATGGA TAGCTAAACC TTCCTATTGC 
Eaton 2 

13451 CCTTTTCTTT CA GCAACTGA TGGAACGCTA GTGACCACCA TAAAGATGTC 
13501 AAGCCAGGC A GCACTVAGGAA ATTCCACGTG GCCTGCCCCA GCAGAGGA6A 

13551 CGGGGACC AG TCCAGCAGGT AAATATAGAC CTTGTTTCCA TTTCTGCTCT 

13601 GCTAATGCCA CCCAAGCCTT TCTTTTCTTT TCTTTTCTTT TCTTTTCTTT 

13651 TCTTTTCTTT TCTTTTCTTT CTCTCCCTTT CTTTCTTTCT TTCTTTCTTT 

13701 CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT CTTTCTTTCT TTCTTTCTTT 

13751 CTTTCTTTCT TTCTTTCTTT CTTTCTTTCT CTTTCTTTCT TCTTTCTCTC 

13801 TCTCTCTTTC TTTCTTTCTC TTGTTCTTTT TAAATTTTTT ATTTTTTTAC 

13851 TTAATTTTTT TCACCCAAGC CTTAAGGCCA GTTTGGACCA GATAGTGAGA 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID 190: 1) I 



13901 
13951 
14001 
14051 
14101 
14151 
14201 
14251 
14301 
14351 
14401 
14451 
14501 
14551 
14601 
14651 
14701 
14751 
14801 
14851 



CCCCACCTCT ATAAAAAAAA TTTTTAAAAA AAAAATAAGT TGGGCATCGT 
GCAGGCCTGT AGTCCCTGCT ACTCGAGAGG CCAAGGTGGG AGGACAGCTT 
GCTGCTGACT AAAAGTGCTG CTTATTGATT CTGGGAAGAA AAAATATACA 
AGGCTTCAGT TTCATTATTT TATAAGTAAA TGCTAGCAAC TTTTCCTTTC 
TTTCTCTCTT TCTCTCTTCC TCTCTTTCTC TCCTCTCCTT CTCTTCTCTC 
TCTCTCTCTC TCTCTCTCTC TTTCTCTCTC CTCTCCTTCT CTTCTCTTCT 
TTCTCTCTCT CTCTCTTTCA TTTATTTTTG AGACATGGTC TCATTCTGTC 
ACCCAGGCTG GAGTACAGTG GTGTATATTT ACTGCAGTAC TCACTGTACT 
CACTGCAGCC TCAAATTCCT GGGCTCAAGC TATCCTCTCA CCTCAGCCTC 
CTGAGTAGCT GGGCAGCAGT CCAGCTCACT CACTGGGCTC TAGAGCCTCT 
GTGCTATGCC CAGCTTATTG TTGTTGTTTT TTTAAATTTT TTTTTTTGTA 
CAGATGGGGT CTCACTATGT GGCCCAAGGT GGTCTTAAAC TCCTGGCTCC 
AAGAGATCCT CCCACCTCAG CCTCCCAAAG TGCAGGGATT ACAGGTGTGA 
GCCACTGTGC CCAGCCTAGA CAGCATTTTT TTTTTTTGAA ACAGGGTCTC 
CCTCTGTTGC CCAGGCTGGA GTGCAATGGC GTGTTCATGG TTCACTGCAG 
CCTCAGCCTC CTCAGTCTCA AGCAATCCTC CAACTTCAGC CTCCCCCAAC 
AGCTAGAACT GCAGGTGATC ATCACCAATT AGCCTGGTTA ATTGTGTGTG 
TATTTCTTAA ATTTTTTGTA GAGATAGTTC TCACTATATT GCTTGGGCTG 
GTCTCAAACT CCTGGACTCA AGTGATTCAC CTACCTCGGC CTCCCTAAGC 
ACTGGGATTA CAGGCTTGAG CCACCACACC CGGCAAGGAC TAGGTTTTAA 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID HO* 1} 



14901 AATAGGTTCC TAGGCTGGGT GTGGTC3GCTT ACGCCCGTAA TCCCAGCACT 

14951 TTGGGAGGCT GAGGTGGGCG GATCACGRGG TCaWSGAGTTT GAGACCAGCC 

15001 TGGCCAACAT AGTGAAACCC TGTCTCTACT AAAAATACAA AAAATTAGCT 

15051 GGGCATAGTG GCACACACCT GTAATCCCAG CTACTCGGGA GGCTGAGGAA 

15101 GGAGAATCAC TTGAACCTGG GAGGCGQAGG TTGCAGTGAG CCX3AGATCAT 

15151 GCCATTGCTC TCCAGCCTGG GTGACAGAGC AAGACTCCAT CTAAAAAAAA 

15201 AAAAAAAAGT TCCTTTGACT TCTTGACACT CTTCTCTGAG GATATTGATC 

15251 ATTTTTCCCC AATAGATGTT ACTAATT6AA CACTTCT6TT GCTTCAACTT 

15301 ACTAATTTAC ATGATCAATA GCCAATTAAT TCAGCAGQAG AGAATGCTAC 

15351 AGAGTCGATT CTTTCTGTAC TTTCTTCTGC TCCA6AGTGA AGGATCTTTC 

15401 TAAATCAGAG ACCATCACTG TGTTCACAGG GAGGGCCTAQ GTGAACCTQA 

15451 GATGGCaAAT GTTGCGTTTG TTCTACGQAA GAAGGGATTA TGGGTTGAAG 

15501 TCCTTGGCAG TGCCAAATTG CTTAGAAAAA TGTGAAATAT GGTCCCTAGG 

15551 AGTGCTCTTG GGATGTCACA TTTTTCTCAC TCCTTTGACA GGTAGATGTT 

15601 ATTTTCCTGA AGGCCAGGQA AAGGATTCAG AGGGAGQAAT 6AATTTGAAA 

15651 GAAAATGAAG GTGACX3AGAA AQAATGAGCT CATCTCCCTT ATCCTCTTTC 

15701 TTCTCAAATC CTTAAGTAGC TTTGCAGTGA ACTAAGATTT GGGGGAACCT 

15751 AGAGQAGGCT GAAAGTTGGA AGCTGAAATT GGCTTAGCAA GGGCAAGCTC 

15801 CAAA6ACAAA AGTGGAAATA GTTTGGGGGT AGCCTTTTGC ATGGGTGAAA 

15851 TCCTGGTTCA TCACATCCTC CCTTATGCAA AGAGCCCTTT TATATGGGGC 



33 



wo 2004/045553 



PCTAJS2003/037041 





Table 1 (continued) 



Qenomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



15901 
15951 
16001 
16051 
16101 
Bxon 3 
16151 
16201 
16251 
16301 
16351 
16401 
16451 
16501 
16551 
16601 
16651 
16701 
16751 
16801 



ATGGGGAAAA ACTGAGCTAA AGGTGATAAT TTCTCCTGAG CAAGCCAGAT 
GGTCAAAGCT CTAACTTCAC CATCTCCCTT GGAATGTTTA ATGTGTTCCC 
TGGTGTCCAG AGGCTTAACG TGTGAGAATT AAAAGCTCAA CATTTTCTTT 
CCCAGGGAAG GAGGAAATAG TTTTAATTGA AATCCCGGGA GGAAATGAAT 
GATAGTGTCA AACCAAAAAA CTTCATCTTC TGTACCACTT GCATATACTC 

CACTGACTTA CTTTCTAATC ACAGG CACAT CCCCAGGAAG CCCAGAAATG 
TCTACCACTC TCAAAATCAT GAGCTCCAAG GAACCCGGCA TCAGCCCAGA 
GATCAGQTCC ACTGTGAGAA ATTCTCCTTG GA/VGACTCCA GAAACAACTG 
TTCCCATGGA GACCACAGTG GAACCAGTCA CCCTTCAGTC CACAGCCCTA 
GGAAGTGGCA GCACCAGCAT CTCTCACCTG CCCACAGGAA CCACATCACC 
AACCAAGTCA CCAACAGAAA ATATGTTGGC TACAGAAAGG GTCTCCCTCT 
CCCCATCCCC ACCTGAGGCT TGGACCAACC TTTATTCTGG AACTCCAGGA 
GGGACCAGGC AGTCACTGGC CACAATGTCC TCTGTCTCCC TAGAGTCACC 
AACTGCTAGA AGCATCACAG GGACTGGTCA GCAAAGCAGT CCAGAACTGG 
TTTTAAAGAC AACTGGAATG GAATTCTCTA TGTGGCATGG CTCTACTGGA 
GGGACCACAG GGGACACACA TGTCTCTCTG AGCACATCTT CCAATATCCT 
TGAAGACCCT GTAACCAGCC CAAACTCTGT GAGCTCATTG ACAGATAAAT 
CCAAACATAA AACCGAGACA TGGGTCAGCA CCACAGCCAT TCCCTCCACT 
GTCCTGAATA ATAAGATAAT GGCAGCTGAA CAACAGACAA GTCGATCTGT 
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Table 1 (continued) 



Genomic CA12S Amino Terminal Sequence 
(SEQ ID NO: 1) 



16851 
16901 
16951 
17001 
17051 
17101 
17151 
17201 
17251 
17301 
17351 
17401 
17451 
17501 
17551 
17601 
17651 
17701 
17751 
17801 



GGATGAGGCT TATTCATCAA CTAGTTCTTG GTCAGATCAG ACATCTGGGA 
GTGACATCAC CCTTGGTGCA TCTCCTGATG TCACAAACAC ATTATACATC 
ACCTCCACAG CACAAACCAC CTCACTAGTA TCTCTGCCCT CTGGAGACCA 
AGGCATTACA AGCCTCACCA ATCCCTCAGG AGGAAAAACA AGCTCTGCAT 
CATCTGTCAC ATCTCCTTCA ATAGGGCTTG AGACTCTGAT GGCCAATGTA 
AGTGCAGTGA CAAGTGACAT TGCCCCTACT GCTGGGCATC TATCTCAGAC 
TTCATCTCCT GCGGAAGTGA GCATCCTGGA CATAACCACA GCTCCTACTC 
CAGGTATCTC CACCACCATC ACCACCATGG GAACCAACTC AATCTCAACT 
ACCACACCCA ACCCAGAAGT GGGTATGAGT ACCATGGACA GCACCCCGGC 
CACAGAGAGG CACACAACTT CTACAGAACA CCCTTCCACC TGGTCTTCCA 
CAGCTGCATC AGATTCCTGG ACTGTCACAG ACATGACTTC AAACTTGAAA 
GTTGCAAGAT CTCCTGGAAC AATTTCCACA ATGCATACAA CTTCATTCTT 
AGCCTCAAGC ACTGAATTAG ACTCCATGTC TACTCCCCAT GGCCGTATAA 
CTGTCATTGG AACCAGCCTG GTCACTCCAT CCTCTGATGC TTCAGCTGTA 
AAGACAGAGA CCAGTACAAG TGAAAGAACA TTGAGTCCTT CAGACACAAC 
TGCATCTACT CCCATCTCAA CTTTTTCTCG TGTCCAGAGG ATGAGCATCT 
CAGTTCCTGA CATTTTAAGT ACAAGTTGGA CTCCCAGTAG TACAGAAGCA 
GAAGATGTGC CTGTTTCAAT GGTTTCTACA GATCATGCTA GTACAAAGAC 
TGACCCAAAT ATGCCCCTGT CCACTTTTCT GTTTGATTCT CTGTCCACTC 
TTGACTGGGA CACTGGGAGA TCTCTGTCAT CAGCCACAGC CACTACCTCA 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



17851 


GCTCCTCAGG 


GGGCCACAAC 


TCCCCAAGAA 


CTCACTTTGG 


AAACCATGAT 


17901 


CAGCCCAGCT 


ACCTCACAGT 


TGCCCTTCTC 


TATAGGGCAC 


ATTACAAGTG 


17951 


CAGTCATACC 


AGCTGCAATG 


GCAAGGAGCT 


CTGGAGTTAC 


TTTTTCAAGA 


18001 


CCAGATCCCA 


CAAGCAAAAA 


GGCAGAGCAG 


ACTTCCACTC 


AGCTTCCCAC 


18051 


CACCACTTCT 


GCACATCCAG 


AGCAGGTGCC 


CA6ATCAGCA 


GCAACAACTC 


18101 


TGGATGTGAT 


CCCACACACA 


GCAAAAACTC 


CAGATGCAAC 


TTTTCAGAGA 


18151 


CAAGGGCAGA 


CAGCTCTTAC 


AACAGAGGCA 


AGAGCTACAT 


CTGACTCCTG 


18201 


GAATGAGAAA 


GAAAAATCAA 


CCCCAAGTGC 


ACCTTGGATC 


ACTGAGATGA 


18251 


TGAATTCTGT 


CTCAGAAGAT 


ACCATCAAGG 


AGGTTACCAG 


CTCCTCCAGT 


18301 


GTGTTAAGGA 


CCCTGAATAC 


GCTGGACATA 


AACTTGGAAT 


CTGGGACGAC 


18351 


TTCATCCCCA AGTTGGAAAA 


GCAGCCCATA 


TGAGAGAATT 


GCCCCTTCTG 


18401 


AGTCTACCAC AGACAAAGAG 


GCAATTCACC 


('"TCTACAAA CACAGTAGAG 


18451 


ACCACTGGCT 


GGGTCACAAG 


TTCCGAACAT 


GCTTCTCATT 


CCACTATCCC 


18501 


AGCCCACTCA 


GCGTCATCCA AACTCACATC 


TCCAGTGGTT 


ACAACCTCCA 


18551 


CCAGGGAACA AGCAATAGTT 


TCTATGTCAA 


CAACCACATG 


GCCAGAGTCT 


18601 


ACAAGGGCTA 


GAACAGAGCC 


TAATTCCTTC 


TTGACTATTG 


AACTGAGGGA 


18651 


CGTCAGCCCT 


TACATGGACA 


CCAGCTCAAC 


CACACAAACA AGTTTTATCT 


18701 


CTTCCCCAGG 


TTCCACTGCG 


ATCACCAAGG 


GGCCTAGAAC 


AGAAATTACC 


18751 


TCCTCTAAGA GAATATCCAG 


CTCATTCCTT 


GCCCAGTCTA 


TGAGGTCGTC 


18801 


AGACAGCCCC 


TCAGAAGCCA TCTCCAGGCT 


GTCTAACTTT 


CCTGCCATGA 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID MOx 1) 



18851 


CAQAATCTGG AGQRATGATC CTTGCTATGC AAACAAGTCC ACCTGGCGCT 


18901 


ACATCACTAA GTGCACCTAC TTTGGATACA TCAGCCACAG CCTCCTGGAC 


18951 


AGGGACTCCA CTGGCTACGA CTCAGAGATT TACATACTCA GAGAAGACCA 


19001 


CTCTCTTTAG CAAAGGTCCT QAGGATACAT CACAGCCAAG CCCTCCCTCT 


19051 


GTGGAAGAAA CCAGCTCTTC CTCTTCCCTG GTACCTATCA ATGCTACAAC 


19101 


CTCGCCTTCC AATATTTTGT TGACATCACA AGGGCACAGT CCCTCCTCTA 


19151 


CTCCACCTGT GACCTCAGTT TTCTTGTCTG AGACCTCTGG CCTGGGGAAG 


19201 


ACCACAGACA TGTCGAGGAT AAGCTTGGAA CCTGGCACAA GTTTACCTCC 


19251 


CAATTTGA6C AGTACAGCAG GTGAGGCGTT ATCCACTTAT GAAGCCTCCA 


19301 


GAGATACAAA GGCAATTCAT CATTCTGCAG ACACAGCAGT GACGAATATG 


19351 


GAGGCAACCA GTTCTGAATA TTCTCCTATC CCAGGCCATA CAAAGCCATC 


19401 


CAAAGCCACA TCTCCATTGG TTACCTCCCA CATCATGGGG GACATCACTT 


19451 


CTTCCACATC AGTATTTGGC TCCTCCGAGA CCACAGAGAT TGAGACAGTG 


19501 


TCCTCTGTGA ACCAGGGACT TCAGGAGAGA AGCACATCCC AGGTGGCCAG 


19551 


CTCTGCTACA GAGACAAGCA CTGTCATTAC CCATGT6TCT A6TGGTGATG 


19601 


CTACTACTCA TGTCACCAAG ACACAAGCCA CTTTCTCTAG CX3GAACATCC 


19651 


ATCTCAAGCC CTCATCAGTT TATAACTTCT ACCAACACAT TTACAGATGT 


19701 


GAGCACCAAC CCCTCCACCT CTCTGATAAT GACAGAATCT TCAGGAGTGA 


19751 


CCATCACCAC CCAAACAGGT CCTACTGGA6 CTGCAACACA GGGTCCATAT 


19801 


CTCTTGGACA CATCAACCAT GCCTTACTTG ACAGAGACTC CATTAGCTGT 
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Table 1 (continaed) 



Qenomic CA125 Amino Tenolnal Sequence 
(SEQ ID NO: 1) 



19851 


GACTCCAGAT 


TTTATGCAAT 


CAGAGAAGAC CACTCTCATA 


AGCAAAGGTC 


19901 


CCAAGGATGT 


GTCCTGGACA AGCCCTCCCT CTGTGGCAGA AACCAGCTAT 


19951 


CCCTCTTCCC 


TGACACCTTT 


CTTGGTCACA ACCATACCTC 


CTGCCACTTC 


20001 


CACGTTACAA 


GGGCAACATA 


CATCCTCTCC TGTTTCTGCG ACTTCAGTTC 


20051 


TTACCTCTGG 


ACTGGTGAAG 


ACCACAGATA TGTTGAACAC 


AAGCATGGAA 


20101 


CCTGTGACCA 


ATTCACCTCA AAATTTGAAC AATCCATCAA ATGAGATACT 


20151 


GGCCACTTTG 


GCAGCCACCA 


CAGATATAGA GACTATTCAT 


CCTTCCATAA 


20201 


ACAAAGCAGT 


GACCAATATG 


GGGACTGCCA GTTCAGCACA 


TGTACTGCAT 


20251 


TCCACTCTCC 


CAGTCAGCTC AGAACCATCT ACAGCCACAT 


CTCCAATGGT 


20301 


TCCTGCCTCC AGCATGGGGG ACGCTCTTGC TTCTAtATCA ATACCTGGTT 


20351 


CTGAGACCAC 


AGACATTGAG 


GGAGAGCCAA CATCCTCCCT 


GACTGCTGGA 


20401 


CGAAAAGAGA ACAGCACCCT 


CCAGGAGATG AACTCAACTA 


CAGAGTCAAA 


20451 


CATCATCCTC 


TCCAATGTGT 


CTGTGGGGGC TATTACTGAA 


GCCACAAAAA 


20501 


TGGAAGTCCC 


CTCTTTTGAT 


GCAACATTCA TACCAACTCC 


TGCTCAGTCA 


20551 


ACAAAGTTCC 


CAGATATTTT 


CTCAGTAGCC AGCAGTAGAC 


TTTCAAACTC 


20601 


TCCTCCCATG 


ACAATATCTA 


CCCACATGAC CACCACCCAG 


ACAGGGTCTT 


20651 


CTGGAGCTAC 


ATCAAAGATT 


CCACTTGCCT TAGACACATC 


AACCTTGGAA 


20701 


ACCTCAGCAG 


GGACTCCATC 


AGTGGTGACT GAGGGGTTTG 


CCCACTCAAA 


20751 


AATAACCACT 


GCAATGAACA ATGATGTCAA GGACGTGTCA CAGACAAACC 


20801 


CTCCCTTTCA 


GGATGAAGCC 


AGCTCTCCCT CTTCTCAAGC ACCTGTCCTT 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



20851 
20901 
20951 
21051 
21101 
21151 
21201 
21251 
21301 
21351 
21401 
21451 
21501 
21551 
21601 
21651 
21701 
21751 
21801 
21851 



GTCACAACCT TACCTTCTTC TQTTGCTTTC ACACCGCAAT GGCACAGTAC 
CTCCTCTCCT GTTTCTATGT CCTCAGTTCT TACTTCTTCA CTQQTAAAGA 
CCGCT^GGCAA GGTGGATACA AGCTTAGAAA CAGTGACCAG TTCACCTCAA 
AGATATAGAG ACAACGCATC CTTCCATAAA CACAGTAGTT ACCAATGTGG 
GGACCACCGG TTCAGCATTT GAATCACATT CTACTGTCTC AGCTTACCCA 
GAGCCATCTA AAGTCACATC TCCAAATGTT ACCACCTCGA CCATGGAAGA 
CTICCACAATT tCCAGATCAA TACCTAAATC CTCTAAGACT ACAAGAACTG 
AGACTGAGAC AACTTCCTCC CTGACTCCTA AACTGAGGGA GACCAGCGTC 
TCCCAGGAGA TCAGCTCGTC CACAGAGACA AGCACTGTTC CTTACAAAGA 
GCTCACTGGT GCCACTACCG AGGTATCCAG GACAGATGTC ACTTCCTCTA 
GCAGTACATC CTTCCCTGGC CCTGATCAGT CCACAGTGTC ACTAGACATC 
TCCACAGAAA CCAACACCAG QCTGTCTACC TCCCCAATAA TGACAGAATC 
TGCAGAAATA ACCATCACCA CCCAAACAGG TCCTCATGGG GCTACATCAC 
AGGATACTTT TACCATGGAC CCATCAAATA CAACCCCCCA GGCAGGGATC 
CACTCAGCTA TGACTCATGG ATTTTCACAA TTGGATGTGA CCACTCTTAT 
GAGCAGAATT CCACAGGATG TATCATGGAC AAGTCCTCCC TCTGTGGATA 
AAACCAGCTC CCCCTCTTCC TTTCTGCCCT CACCTGCAAT GACCACACCT 
TCCCTGATTT CTTCTACCTT ACCAGAGGAT AAGCTCTCCT CTCCTATGAC 
TTCACTTCTC ACCTCTGGCC TAGTGAAGAT TACAGACATA TTACGTACAC 
GCTTGGAACC TGTGACCAGC TCACTTCCAA ATTTCAGCAG CACCTCAGAT 
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Table 1 (continued) 



Qenomlc CA125 Amino Terminal Sequence 
(SSQ ID NO: 1) 



21901 


AAGATACTGG 


CCACTTCTAA AGACAGTAAA GACACAAAGG AAATTTTTCC 


21951 


TTCTATAAAC 


ACAGAAGAGA CCAATGTGAA AGCCAACAAC TCTGGACATG 


22001 


AATCCCATTC 


CCCTGCACTG GCTGACTCAG AGACACCCAA AGCCACAACT 


22051 


CAAATGGTTA 


TCACCACCAC TGTGGGAGAT CCAGCTCCTT CCACATCAAT 


22101 


GCCAGTGCAT 


GGTTCCTCTG AGACTACAAA CATTAAGAGA QAGCCAACAT 


22151 


ATTTCTTGAC 


TCCTAGACTG AGAGAGACCA GTACCTCTCA GGAGTCCAGC 


22201 


TTTCCCACGG 


ACACAAGTTT TCTACTTTCC AAAGTCCCCA CTGGTACTAT 


22251 


TACTGAGGTC 


TCCAGTACAG GGGTCATCTC TTCTAGCAAA ATTTCCACCC 


22301 


CAGACCATGA 


TAAGTCCACA GTGCCACCTG ACACCTTCAC AGGAGAGATC 


22351 


CCCAGGGTCT 


TCACCTCCTC TATTAAGACA AAATCTGCAG AAATGACGAT 


22401 


CACCACCCAA 


GCAAGTCCTC CTGAGTCTGC ATCGCACAGT ACCCTTCCCT 


22451 


TGGACACATC 


AACCACACTT TCCCAGGGAG GGACTCATTC AACTGTGACT 


22501 


CAGGGATTCC 


CATACTCAGA GGT6ACCACT CTCATGGGCA TGGGTCCTGG 


22551 


GAATGTGTCA 


TGGATGACAA CTCCCCCTGT GGAAGAAACC AGCTCTGTGT 


22601 


CTTCCCTGAT 


GTCTTCACCT GCCATGACAT CCCCTTCTCC TGTTTCCTCC 


22651 


ACATCACCAC 


AGAGCATCCC CTCCTCTCCT CTTCCTGTGA CTGCACTTCC 


22701 


TACTTCTGTT 


CTGGTGACAA CCACAGATGT GTTGGGCACA ACAAGCCCAG 


22751 


AGTCTGTAAC 


CAGTTCACCT CCAAATTTGA GCAGCATCAC TCATGAGAGA 


22801 


CCGGCCACTT 


ACAAAGACAC TGCACACACA GAAGCCGCCA TGCATCATTC 


22851 


CACAAACACC 


GCAGTGACCA ATGTAGGGAC TTCCGGGTCT GGACATAAAT 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



22901 


CACAATCCTC TGTCCTAGCT GACTCAGAGA CATCGAAAGC CACACCTCTG 


22951 


ATGAGTACCA CCTCCACCCT GGGG6ACACA AGTGTTTCCA C31TCAACTCC 


23001 


TAATATCTCT CAGACTAACC AAATTCAAAC AGAGCCAACA GCATCCCTGA 


23051 


GCCCTAGACT GAGGGAGAGC AGCACGTCTG AGAAGACCA6 CTCAACAACA 


23101 


GAGACAAATA CTGCCTTTTC TTATGTGCCC ACAGGTGCTA TTACTCAGGC 


23151 


CTCCAGAACA GAAATCTCCT CTAGCAGAAC ATCCATCTCA GACCTTGATC 


23201 


GGTCCACAAT AGCACCCGAC ATCTCCACAG GAATGATCAC CAGGCTCTTC 


23251 


ACCTCCCCCA TCATGACAAA ATCTGCAGAA ATGACCGTCA CCACTCS^C 


23301 


AACTACTCCT GGGGCTACAT CAC3M3GGTAT CCTTCCCTGG GACACATCAA 


23351 


CCACACTTTT CCAGGGAGGG ACTCATTCAA CCGTGTCTCA GGGATTCCCA 


23401 


GACTCAGAGA TAACCACTCT TCGGAGCAGA ACCCCTGGAG ATOTGTCATG 


23451 


GATGACAACT CCCCCTGTGG AAGAAACCAG CTCT6GGTTT TCCCTGATGT 


23501 


CACCTTCCAT GACATCCCCT TCTCCTGTTT CCTCCACATC ACCAGAGAGC 


23551 


ATCCCCTCCT CTCCTCTCCC TGTGACTGCA CTTCTTACTT CTGTTCTGGT 


23601 


GACAACCACA AATGTATTGG GCACAACAAG CCCAGAGCCC GTAACGAGTT 


23651 


CACCTCCAAA TTTAAGCAGC CCCACACAGG AGAGACTGAC CACTTACAAA 


23701 


GACACTGCGC ACACAGAAGC CATGCATGCT TCCATGCATA CAAACACTGC 


23751 


AGTGGCCAAC GTGGGQACCT CCATTTCTGG ACATGAATCA CAATCTTCTG 


23801 


TCCCAGCTGA TTCAQACACA TCCAAAGCCA CATCTCCAAT GGGTACCACC 


23851 


TTCGCCATGG GGGATACAAG TGTTTCTACA TCAACTCCTG CCTTCTTTQA 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



23901 


GACTAGTIATT 


CAGACTGAAT 


CAACATCCTC 


TTTGATTCCT 


GGATTT^GGG 


23951 


ACACCAGGAC 


GTCTGAGGAG 


ATCAACACTG 


TGACAGAGAC 


CAGCACTGTC 


24001 


CTTTCAGAAG 


TGCCCACTAC 


TACTACTACT 


GAGGTCTCCA GGACAGAAGT 


24051 


TATCACTTCC 


AGCAGAACAA 


CCATCTCAGG 


GCCTGATCAT 


TCCAAAATGT 


24101 


CACCCTACAT 


CTCCACAGAA ACCATCACCA 


GGCTCTCCAC 


TTTTCCTTTT 


24151 


GTAACAGGAT 


CCACAGAAAT 


GGCCATCACC 


AACCAAACAG 


GTCCTATAGG 


24201 


GACTATCTCA 


CAGGCTACCC 


TTACCCTGGA 


CACATCAAGC 


ACAGCTTCCT 


24251 


GGGAAGGGAC 


TCACTCACCT 


GTGACTCAGA 


GATTTCCACA 


CTCAGAGGAG 


24301 


ACCACTACTA 


TGAGCAGAAG 


TACTAAGGGC 


GTGTCATGGC 


AAAGCCCTCC 


24351 


CTCTGTGGAA 


GAAACCAGTT 


CTCCTTCTTC 


CCCAGTGCCT 


TTACCTGCAA 


24401 


TAACCTCACA 


TTCATCTCTT 


TATTCCGCAG 


TATCAGGAAG 


TAGCCCCACT 


24451 


TCTGCTCTCC 


CTGTGACTTC 


CCTTCTCACC 


TCTGGCAGGA GGAAGACCAT 


24501 


AGACATGTTG 


GACACACACT 


CAGAACTTGT 


GACCAGCTCC 


TTACCAAGTG 


24551 


CAAGTAGCTT 


CTCAGGTGAG 


ATACTCACTT 


CTGAAGCCTC 


CACAAATACA 


24601 


GAGACAATTC 


ACTTTTCAQA GAACACAGCA 


GAAACCAATA TGGGGACCAC 


24651 


CAATTCTATG 


CATAAACTAC 


ATTCCTCTGT 


CTCAATCCAC 


TCCCAGCCAT 


24701 


CCGGACACAC 


ACCTCCAAAG 


GTTACTGGAT 


CTATGATGGA 


GGACGCTATT 


24751 


GTTTCCAC7VT 


CAACACCTGG 


TTCTCCTGAG 


ACTAAAAATG 


TTGACAGAGA 


24801 


CTCAACATCC 


CCTCTGACTC 


CTGAACTGAA 


AGAGGACAGC 


ACCGCCCTGG 


24851 


TGATGAACTC 


AACTACAGAG 


TCAAACACTG 


TTTTCTCCAG 


TGTGTCCCTG 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SBQ ID NO: 1) 



24901 


GATGCTGCTA CTGAGGTCTC CAGGGCAGAA GTCACCTACT ATGATCCTAC 


24951 


ATTCAT6CCA GCTTCTGCTC AGTCAACAAA GTCCCCAGAC ATTTCACCTG 


25001 


AAGCCAGCAG CAGTCATTCT AACTCTCCTC CCTTQACAAT ATCTACACAC 


25051 


AAGACCATCG CCACACAAAC AGGTCCTTCT GGGGTGACAT CTCTTGGCCA 


25101 


ACTGACCCTG GACACATCAA CCATAGCCAC CTCAGCAGGA ACTCCATCAG 


25151 


CCAGAACTCA GGATTTTGTA GATTCAGAAA CAACCAGTGT CATGAACAAT 


25201 


GATCTCAATG ATGTGTTGAA QACAAGCCCT TTCTCTGCAG AAGAAGCCAA 


25251 


CTCTCTCTCT TCTCAGGCAC CTCTCCTTGT GACAACCTCA CCTTCTCCTG 


25301 


TAACTTCCAC ATTGCAAGAG CACAGTACCT CCTCTCTTGT TTCTGTGACC 


25351 


TCAGTACCCA CCCCTACACT GGCGAAGATC ACAGACATGG ACACAAACTT 


25401 


AGAACCTGTG ACTCGTTCAC CTCAAAATTT AAGGAACACC TTGGCCACTT 


25451 


CAGAAGCCAC CACAGATACA CACACAATGC ATCCTTCTAT AAACACAGCA 


25501 


GTGGCCAATG TGGGQACCAC CAGTTCACCA AATGAATTCT ATTTTACTGT 


25551 


CTCACCTQAC TCAGACCCAT ATAAAGCCAC ATCCGCAGTA GTTATCACTT 


25601 


CCACCTCGGG GGACTCAATA GTTTCCACAT CAATGCCTAG ATCCTCTGCG 


25651 


ATGAAAAAGA TTGAGTCTGA GACAACTTTC TCCCTGATAT TTAGACTGAG 


25701 


GGAGACTAGC ACCTCCCAQA AAATTGGCTC ATCCTCAGAC ACAAGCACGG 


25751 


TCTTTGACAA AGCATTCACT GCTGCTACTA CTGAGGTCTC CAQAACAGAA 


25801 


CTCACCTCCT CTAGCAGAAC ATCCATCCAA GGCACTGAAA A6CCCACAAT 


25851 


GTCACCGGAC ACCTCCACAA GATCTGTCAC CATGCTTTCT ACTTTTGCTG 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Secpaence 
(SEQ IB NO: 1) 



25901 


GCCTGACAAA ATCCGAAGAA AGGACCATTG CCACCCAAAC AGGTCCTCAT 


25951 


AGGGCGACAT CACAGGGTAC CCTTACCTGG GACACATCAA TCACAACCTC 


26001 


ACAGGCAGGG ACCCACTCAG CTATGACTCA TGGATTTTCA CAATTAGATT 


26051 


TGTCCACTCT TACGAGTAGA GTTCCTGAGT ACATATCAGG GACAAGCCCA 


26101 


CCCTCTGTGG AAAAAACCAG CTCTTCCTCT TCCCTTCTGT CTTTACCAGC 


26151 


AATAACCTCA CCGTCCCCTG TACCTACTAC ATTACCAGAA AGTAGGCCGT 


26201 


CTTCTCCTGT TCATCTGACT TCACTCCCCA CCTCTGGCCT AGTGAAGACC 


26251 


ACAGATATGC TGGCATCTGT GGCCAGTTTA CCTCCAAACT TGGGCAGCAC 


26301 


CTCACATAAG ATACCGACTA CTTCAGAAGA CATTAAAGAT ACAGAGAAAA 


26351 


TGTATCCTTC CACAAACATA GCAGTAACCA ATGTGGGGAC CACCACTTCT 


26401 


GAAAAGGAAT CTTATTCGTC TGTCCCAGCC TACTCAGAAC CACCCAAAGT 


26451 


CACCTCTCCA ATGGTTACCT CTTTCAACAT A^nGGACPZC ATTGTTTCCA 


26501 


CATCCATGCC TGGCTCCTCT GAGATTACAA GGATTGAGAT GGAGTCAACA 


26551 


TTCTCCCTGG CTCATGGGCT GAAGGQAACC AGCACCTCCC AGGACCCCAT 


26601 


CGTATCCACA GAGAAAAGTG CTGTCCTTCA CAAGTTGACC ACTGGTGCTA 


26651 


CTGAGACCTC TAGGACAGAA GTTGCCTCTT CTAGAAGAAC ATCCATTCCA 


26701 


GGCCCTGATC ATTCCACAGA GTCACCAGAC ATCTCCACTG AAGTGATCCC 


26751 


CAGCCTGCCT ATCTCCCTTG GCATTACAGA ATCTTCAAAT ATGACCATCA 


26801 


TCACTCGAAC AGGTCCTCCT CTTGGCTCTA CATCACAGGG CACATTTACC 


26851 


TTGGACACAC CAACTACATC CTCCAGGGCA GGAACACACT CGATGGCGAC 
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Table 1 (contintted) 



Genomic CA125 Amino Terminal Sequence 
(SSQ ID NO: 1) 



26901 
26951 
27001 
27051 
27101 
27151 
27201 
27251 
27301 
27351 
27401 
27451 
27501 
27551 
27601 
27651 
27701 
27751 
27801 
27851 



TCAGGAATTT CCACACTCAG AAATGACCAC TGTCATGAAC AAGGACCCTG 
AGATTCTATC ATGGACAATC CCTCCTTCTA TAGAGAAAAC CAGCTTCTCC 
TCTTCCCTGA TGCCTTCACC AGCCATGACT TCACCTCCTG TTTCCTCAAC 
ATTACCj^G ACCATTCACA CCACTCCTTC TCCTATGACC TCACTGCTCA 
CCCCTAGCCT AGTGATGACC ACAGACACAT TGGGCACAAG CCCAGAACCT 
ACAACCAGTT CACCTCCAAA TTTQAGCAGT ACCTCACATG AGATACTGAC 
AACAGATGAA GACACCACAG CTATAGAAGC CATGCATCCT TCCACAAGCA 
CAGCAGCGAC TAATGTGGAA ACCACCAGTT CTGGACATGG GTCACAATCC 
TCTGTCCTAG CTGACTCAGA AAAAACCAAG GCCACAGCTC CAATGGATAC 
CACCTCCACC ATGGGGCATA CAACTGTTTC CACATGAATG TCTGTTTCCT 
CTGAGACTAC AAAAATTAAG AGAGAGTCAA CATATTCCTT GACTCCTGGA 
CTGAGAGAGA CCAGCATTTC CCAAAATGCC AGCTTTTCCA CTGACACAAG 
TATTGTTCTT TCAGAAGTCC CCACTGGTAC TACTGCTGAG GTCTCCAGGA 
CAGAAGTCAC CTCCTCTGGT AGAACATCCA TCCCTGGCCC TTCTCAGTCC 
ACAGTTTTGC CAGAAATATC CACAAQAACA ATGACAAGGC TCTTTGCCTC 
GCCCACCATG ACAGAATCAG CAGAAATGAC CATCCCCACT CAAACAGGTC 
CTTCTGGGTC TACCTCACAG GATACCCTTA CCTTGGACAC ATCCACCACA 
AAGTCCCAGG CAAAGACTCA TTCAACTTTG ACTCAGAGAT TTCCACACTC 
AGAGATGACC ACTCTCATGA GCAGAGGTCC TGGAGATATG TCATGGCAAA 
GCTCTCCCTC TCTGGAAAAT CCCAGCTCTC TCCCTTCCCT GCTGTCTTTA 



45 



wo 2004/045553 



PCT/US2003/037041 



Table 1 (continued) 



Oenomlc CA125 Amlxio Terminal Sequence 
(SBQ ID NO: 1) 



27901 


CCTGCCACAA 


CCTCACCTCC 


TCCCATTTCC 


TCCACATTAC 


CAGTGACTAT 


27951 


CTCCTCCTCT 


CCTCTTCCTG 


TGACTTCACT 


TCTCACCTCT 


AGCCCGGTAA 


28001 


CGACCACA6A 


CATGTTACAC ACAAGCCCAG AACTTGTAAC 


CAGTTCACCT 


28051 


CCAAAGCTGA 


GCCACACTTC AGATGAGAGA 


CTGACCACTG 


GCAAGGACAC 


28101 


CACAAATACA 


GAAGCTGTGC 


ATCCTTCCAC 


AAACACAGCA 


GCGTCCAATG 


28151 


TGGAGATTCC 


CAGCTCTGGA 


CATGAATCCC 


CTTCCTCTGC 


CTTAGCTGAC 


28201 


TCAGAGACAT 


CCAAAGCCAC 


ATCACCAATG 


TTTATTACCT 


CCACCCAGGA 


28251 


GGATACAACT 


GTTGCCATAT 


CAACCCCTCA 


CTTCTTGGAG 


ACTAGCAGAA 


26301 


TTCAGAAAGA 


GTCAATTTCC 


TCCCTGAGCC 


CTAAATTGAG 


GGAGACAGGC 


28351 


AGTTCTGTGG 


AGACAAGCTC 


AGCCATAGAG 


ACAAGTGCTG 


TCCTTTCTGA 


28401 


AGTGTCCGTT 


GGTGCTACTA 


CTGAGATCTC 


CAGGACAGAA 


GTCACCTCCT 


28451 


CTTAGCAGAAC 


ATCCATCTCT 


GGTTCTGCTG AGTCCACAAT 


GTTGCCAGAA 


28501 


ATATCCACCA 


CAAGAAAAAT 


CATTAAGTTC 


CCTACTTCCC 


CCATCCTGGC 


28551 


AGAATCATCA 


GAAATGACCA TCAAGACCCA AACAAGTCCT 


CCTGGGTCTA 


28601 


CATCAGAGAG 


TACCTTTACA TTAGACACAT 


CAACCACTCC 


CTCCTTGGTA 


28651 


ATAACCCATT 


CGACTATGAC 


TCAGAGATTG 


CCACACTCAG AGATAACCAC 


28701 


TCTTGTGAGT 


AGAGGTGCTG 


GGGATGTGCC 


ACGGCCCAGC 


TCTCTCCCTG 


28751 


TGGAAGAAAC 


AAGCCCTCCA TCTTCCCAGC 


TGTCTTTATC 


TGCCATGATC 


28801 


TCACCTTCTC 


CTGTTTCTTC 


CACATTACCA 


GCAAGTAGCC 


ACTCCTCTTC 


28851 


TGCTTCTGTG 


ACTTCACTTC 


TCACACCAGG 


CCAAGTGAAG ACTACTGAGG 
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Table 1 (continued) 



Genomic GA125 Amino Terminal Sequence 
(SEQ ID HO: 1) 



28901 


TGTTGGACGC AAGTGCAQAA CCTGAAACCA GTTCACCTCC AAGTTTGAGC 


28951 


A6CACCTCAG TTGAAATACT GGCCACCTCT GAAGTCACCA CAGATACGGA 


29001 


GAAAATTCAT CCTTTCTCAA ACACGGCAGT AACCAAAGTT GGAACTTCCA 


29051 


GTTCTGGACA TGAATCCCCT TCCTCTGTCC TACCTGACTC AGAGACAACC 


29101 


AAAGCCACAT CGGCAATGGG TACCATCTCC ATTATGGGGG ATACAAGTGT 


29151 


TTCTACATTA ACTCCTGCCT TATCTAACAC TAGGAAAATT CAGTCAQAGC 


29201 


CAGCTTCCTC ACTQACCACC AGATTGAGGG AGACCAGCAC CTCTGAAGAG 


29251 


ACCAGCTTAG CCACAGAAGC AAACACTGTT CTTTCTAAAG TGTCCACTGG 


29301 


TGCTACTACT GAGGTCTCCA GGACAGAAGC CATCTCCTTT AGCAGAACAT 


29351 


CCATGTCAGG CCCTGAGCAG TCCACAATGT CACAAQACAT CTCCATAGGA 


29401 


ACCATCCCCA GGATTTCTGC CTCCTCTGTC CTGACAGAAT CTGCAAAAAT 


29451 


GACCATCACA ACCCAAACAG GTCCTTCGGA GTCTACACTA GAAAGTACCC 


29501 


TTAATTTGAA CACAGCAACC ACACCCTCTT GGGTGGAAAC CCACTCTATA 


29551 


GTAATTCAGG GATTTCCACA CCCAQAGATG ACCACTTCCA TGGGCAGAGG 


29601 


TCCTGGAGGT GTGTCATGGC CTAGCCCTCC CTTTQTGAAA GAAACCAGCC 


29651 


CTCCATCCTC CCCGCTGTCT TTACCTGCCG TGACCTCACC TCATCCTGTT 


29701 


TCCACCACAT TCCTAGCACA TATCCCCCCC TCTCCCCTTC CTGTGACTTC 


29751 


ACTTCTCACC TCTGGCCCGG CGACAACCAC AGATATCTTG GGTACS^GCA 


29801 


CAGAACCTGG TW^CCAGTTCA TCTTCAAGTT TGAGCACCAC CTCCCATQAG 


29851 


AQACTGACCA CTTACAAAGA CACTGCACAT ACAGAAGCCG TRTATrnTTr 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: I) 



29901 CACAAACACA GGAGGGACCA ATGTGGCAAC CACCAGCTCT GGATATAAAT 

29951 CACAGTCCTC TGTCCTAGCT GACTCATCTC CAATGTGTAC CACCTCCACC 

30001 ATGGGGGATA CAAGTGTTCT CACATCAACT CCTGCCTTCC TTGAGACTAG 

30051 GAGGATTCAG ACAGAGCTAG CTTCCTCCCT GACCCCTGGA TTGAGGGAGT 

30101 CCAGTGGCTC TGAAGGGACC AGCTCAGGCA CCAAGATGAG CACTGTCCTC 

30151 TCTAAAGTGC CCACTGGTQC TACTACTGAG ATCTCCAAGG AAQACGTCAC 

30201 CTCCATCCCA GGTCCCGCTC AATCCACAAT ATCACCAGAC ATCTCCACAA 

30251 GAACCGTCAG CTGGTTCTCT ACATCCCCTG TCATGACAGA ATCAGCAGAA 

30301 ATAACCATGA ACACCCATAC AAGTCCTTTA GGGGCCACAA CACAAGGCAC 

30351 CAGTACTTTG GCCACGTCAA GCACAACCTC TTTGACAATG ACACACTCAA 

30401 CTATATCTCA AGGATTTTCA CACTCACAGA TGAGCACTCT TATGAGGAGG 

30451 GGTCCTGAGG ATGTATCATG GATGAGCCCT CCCCTTCTGG AAAAAACTAG 

30501 ACCTTCCTTT TCTCTGATGT CTTCACCAGC CACAACTTCA CCTTCTCCTG 

30551 TTTCCTCCAC ATTACCAGAG AGCATCTCTT CCTCTCCTCT TCCTGTGACT 

30601 TCACTCCTCA CGTCTGGCTT GGCAAAAACT ACAGATATGT TGCACAAAAG 

30651 CTCAGAACCT GTAACCAACT CACCTGCAAA TTTGAGCAGC ACCTCAGTTG 

30701 AAATACTGGC CACCTCTGAA GTCACCACAG ATACAGAGAA AACTCATCCT 

30751 TCTTCAAACA GAACAGTGAC CGATGTGGGG ACCTCCAQTT CTGGACATGA 

30801 ATCCACTTCC TTTGTCCTAG CTGACTCACA GACATCCAAA GTCACATCTC 



.30851 



CAATGGTTAT TACCTCCACC ATGGAGGATA CGAQTQTCTC CACATCAACT 
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Table 1 (continued) 



Genonio GA125 Amino Terminal Sequence 
(SEQ ID NOt 1) 



30901 


CCTGGCTTTT TTGAGACTAG CAGAATTCAG ACAGAACCAA CATCCTCCCT 


30951 


GACCCTTGGA CTGAGAAAGA CCAGCAGCTC TGAGGGGACC AGCTTAGCCA 


31001 


CA6AGATGAG CACTGTCCTT TCTGGAGTGC CCACTGGTGC C3VCTGCTGAA 


31051 


GTCTCCAGGA CAGAAGTCAC CTCCTCTAGC AGAACATCCA TCTCAGGCTT 


31101 


TGCTCAGCTC ACAGTGTCAC CAGAGACTTC CACAGAAACC ATCACCAGAC 


31151 


TCCCTACCTC CAGCATAATG ACAGAATCAG CAGAAATGAT GATCAAQACA 


31201 


CAAACAGATC CTCCTGGGTC TACACCAGAG AGTACTCATA CTGTGGACAT 


31251 


ATCAACAACA CCCAACTGGG TAGAAACCCA CTCGACTGT6 ACTCAGAGAT 


31301 


TTTCACACTC AGAGATGACC ACTCTTGTGA GCAGAAGCCC TGGTQATATG 


31351 


TTATGGCCTA GTCAATCCTC TGTGGAAGAA ACCAGCTCTG CCTCTTCCCT 


31401 


GCTGTCTCTG CCTGCCACQA CCTCACCTTC TCCTGTTTCC TCTACATrAG 


31451 


TAGAGGATTT CCCTTCCGCT TCTCTTCCTG TGACTTCTCT TCTCACCCCT 


31501 


GGCCTGGTGA TAACCACAGA CAGGATGGGC ATAAGCAGAG AACCTGGAAC 


31551 


CAGTTCCACT TCAAATTTGA GCAGCACCTC CCATGAGAGA CTGACCACTT 


31601 


TGGAAGACAC TGTAQATACA GAAGACATGC AGCCTTCCAC ACACACAGCA 


31651 


GTGACCAACG TQAGQACCTC CATTTCTGGA CATGAATCAC AATCTTCTGT 


31701 


CCTATCTGAC TCAGAGACAC CCAAAGCCAC ATCTCCAATG GGTACCACCT 


31751 


ACACCATGGG GGAAACGAGT GTTTCCATAT CCACTTCTGA CTTCTTTGAG 


31801 


ACCAGCS^GAA TTCAQATAGA ACCaACATCC TCCCTGACTT CTGGATTGAG 


31851 


GGAGACCAGC AGCTCTGAGA GGATCAGCTC AGCCACAGAG GGAAnfArrr! 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



31901 


TCCTTTCTGA 


AGTGCCCAGT 


GGTGCTACCA 


CTGAGGTCTC 


CAGGACAGAA 


31951 


GTGATATCCT 


CTAGGGGAAC 


ATCCATGTCA 


GGGCCTGATC 


AGTTCACCAT 


32001 


ATCACCAGAC 


ATCTCTACTG AAGCGATCAC 


CAGGCTTTCT 


ACTTCCCCCA 


32051 


TTATGACAGA 


ATCAGCAQAA 


AGTGCCATCA 


CTATTGAGAC 


AGGTTCTCCT 


32101 


GGGGCTACAT 


CAGAGGGTAC 


CCTCACCTTG 


GACACCTCAA 


CAACAACCTT 


32151 


TTGGTCAGGG 


ACCCACTCAA 


CTGCATCTCC 


AGGATTTTCA 


CACTCAGAGA 


32201 


TGACCACTCT 


TATGAGTAGA 


ACTCCTGGAG 


ATGTGCCATG 


GCCGAGCCTT 


32251 


CCCTCTGTGG 


AAGAAGCCAG 


CTCTGTCTCT 


TCCTCACTGT 


CTTCACCTGC 


32301 


CATGACCTCA ACTTCTTTTT 


TCTCCACATT 


ACCAGAGAGC 


ATCTCCTCCT 


32351 


CTCCTCATCC 


TGTGACTGCA 


CTTCTCACCC 


TTGGCCCAGT 


GAAGACCACA 


32401 


GACATGTTGC 


GCACAAGCTC AGAACCTGAA ACCAGTTCAC 


CTCCAAATTT 


32451 


GAGCAGCACC 


TCAGCTGAAA 


TATTAGCCAC 


GTCTGAAGTC ACCAAAGATA 


32501 


GAGAGAAAAT 


TCATCCCTCC 


TCAAACACAC 


CTGTAGTCAA 


TGTAGGGACT 


32551 


GTGATTTATA 


AACATCTATC 


CCCTTCCTCT 


GTTTTGGCTG 


ACTTAGTGAC 


32601 


AACAAAACCC 


ACATCTCCAA 


TGGCTACCAC 


CTCCACTCTG 


GGGAATACAA 


32651 


GTGTTTCCAC 


ATCAACTCCT 


GCCTTCCCAG 


AAACTATGAT 


GACACAGCCA 


32701 


ACTTCCTCCC 


TGACTTCTGG 


ATTAAGGGAG 


ATCAGTACCT 


CTCAAGAGAC 


32751 


CAGCTCAGCA 


ACAGAGAGAA GTGCTTCTCT 


TTCTGGAATG 


CCCACTGGTG 


32801 


CTACTACTAA 


GGTCTCCAGA ACAGAAGCCC 


TCTCCTTAGG 


CAGAACATCC 


32851 


ACCCCAGGTC 


CTGCTCAATC 


CACAATATCA 


CCAGAAATCT 


CCACGGAAAC 
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Table 1 (continaed) 



Genomic CA125 Axnino Terminal Sequence 
(SEQ ID NO; 1) 



32901 CATCACTAGA ATTTCTACTC CCCTCACCAC GACAGGATCA GCAGAAATGA 



32951 


CCATCACCCC 


CAAAACAGGT CATTCTGGGG CATCCTCACA AGGTACCTTT 


33001 


ACCTTGGACA 


CATCAA6CAG AGCCTCCTGG CCAGGAACTC ACTCAGCTGC 


33051 


AACTCACAGA 


TCTCCACACT CAGGGATGAC CACTCCTATG AGCAGAGGTC 


33101 


CTGAGGATGT 


GTCATGGCCA AGCCGCCCAT CAGTGGAAAA AACTAGCCCT 


33151 


CCATCTTCCC 


TGGTGTCTTT ATCTGCAGTA ACCTCACCTT CGCCACTTTA 


33201 


TTCCACACCA TCTGAGAGTA GCCACTCATC TCCTCTCCGG GTGACTTCTC 


33251 


TTTTCACCCC 


TGTCATGATG AAGACCACAG ACATGTTGGA CACAAGCTTG 


33301 


GAACCTGTGA 


CCACTTCACC TCCCAGTATG AATATCACCT CAGATGAGAG 


33351 


TCTGGCCACT 


TCTAAAGCCA CCATGGAGAC AGAGGCAATT CAGCTTTCAG 



[0056] 33401 AAAACACAGC TGTGACTCAG ATGGGCACCA TCAGCGCTAG 

ACAAGAATTC 

33451 TATTCCTCTT ATCCAGGCCT CCCAGAGCCA TCCAAAGTGA CATCTCCAGT 



33501 


GGTCACCTCT 


TCCACCATAA AAGACATTGT TTCTACAACC ATACCTGCTT 


33551 


CCTCTGAGAT 


AACAAGAATT GAGATGGAGT CAACATCCAC CCTGACCCCC 


33601 


ACACCAAGGG 


AGACCAGCAC CTCCCAGGAG ATCCACTCAG CCACAAAGCC 


33651 


AAGCACTGTT 


CCTTACAAGG CACTCACTAG TGCCACGATT GAGGACTCCA 


33701 


TGACACAAGT 


CATGTCCTCT AGCAGAGGAC CTAGCCCTGA TCAGTCCACA 


33751 


ATGTCACAAG 


ACATATCCAG TGAAGTGATC ACCAGGCTCT CTACCTCCCC 


33801 


CATCAAGGCA GAATCTACAG AAATGACCAT TACCACCCAA ACAGGTTCTC 


33851 


CTGGGGCTAC 


ATCAAGGGGT ACCCTTACCT TGGACACTTC AACAACTTTT 
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Table 1 (continued) 



Genomic GA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



33901 
33951 
34001 
34051 
34101 
34151 
34201 
34251 
34301 
34351 
34401 
34451 
34501 
Exon 4 
34551 
34601 
34651 
34701 
34751 
34801 



ATGTCAGGGA CCCACTCAAC TGCATCTCAA GGATTTTCAC ACTCACAGAT 
GACCGCTCTT ATGAGTAGAA CTCCTGGAGA TGTGCCATGG CTAAGCCATC 
CCTCTGTGGA AGAAGCCAGC TCTGCCTCTT TCTCACTGTC TTCACCTGTC 
ATGACCTCAT CTTCTCCCGT TTCTTCCACA TTACCAGACA GCATCCACTC 
TTCTTCGCTT CCTGTGACAT CACTTCTCAC CTCAGGGCTG GTGAAGACCA 
CAGAGCTGTT GGGCACAAGC TCAGAACCTG AAACCAGTTC ACCCCCAAAT 
TTGAGCAGCA CCTCAGCTGA AATACTGGCC ACCACTGAAG TCACTACAGA 
TACAGAGAAA CTGGAGATGA CCAATGTGGT AACCTCAGGT TATACACATG 
AATCTCCTTC CTCTGTCCTA GCTGACTCAG TGACAACAAA GGCCACATCT 
TCAATGGGTA TCACCTACCC CACAGGAGAT ACAAATGTTC TCACATCAAC 
CCCTGCCTTC TCTGACACCN N2JNNNNNN1JN NNNNJ3NNIJNN NNNNWNNNNN 
NNNNNNNNNN NNNNNIilNNNN Nl ^"^mmiN NMNNNNNMNN 

NmmmmN nnnnnnnnnn nnnnnknnnn nnnncggaaa ccaagtttct 

AACCAACCCC TCCTTTTTGA CCC CAGTAGG ATTCAAACAA AGTCAAAGCT 
CTCACTGACT CCTGGGTTGA TGGAGACCAG CATCTCTGAA GAGACCAGCT 
CTGCCACAGA AAAAAGCACT GTCCTTTCTA GTGTGCCCAC TGGTGCTACT 
ACTGAGGTCT CCAGGACAGA AGCCATCTCT TCTAGCAGAA CATCCATCCC 
AGGCCCTQCT CAATCCACAA TGTCATCAGA CACCTCCATG GAAACCATCA 
CTAGAATTTC TACCCCCCTC ACAAGGAAAG AATCAACAGA CATGGCCATC 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



34851 ACCCCCAAAA CAGGTCCTTC TGGGGCTACC TCGCAGGGTA CCTTTACCTT 



34901 


GGACTCATCA AGCACAGCCT 


CCTGGCCAGG AACTCACTCA GCTACAACTC 


34951 


AGAC3ATTTCC 


ACAGTCAGTG 


GTGACAACTC 


CTATGAGCAG AGGTCCTGAG 


35001 


GATGTGTCAT 


GGCCAAGCCC 


GCTGTCTGTG 


GAAAAAAACA 


GCCCTCCATC 


35051 


TTCCCTGGTA 


TCTTCATCTT 


CAGTAACCTC 


ACCTTCGCCA 


CTTTATTCCA 


35101 


CACCATCTGG 


GAGTAGCCAC 


TCCTCTCCTG 


TCCCTGTCAC 


TTCTCTTTTC 


35151 


ACCTCTATCA 


TGATGAAGGC 


CACAGACATG 


TTGGATGCAA 


GTTTGGAACC 


35201 


TGAGACCACT 


TCAGCTCCCA ATATGAATAT 


CACCTCAGAT 


GAGAGTCTGG 


35251 


CCACTTCTAA 


AGCCACCACG 


GAGACAGAGG 


CAATTCACGT 


TTTTGAAAAT 


35301 


ACAGCAGCGT 


CCCATGTGGA AACCACCAGT 


GCTACAGAGG 


AACTCTATTC 


35351 


CTCTTCCCCA 


GGCTTCTCAG AGCCAACAAA AGTGATATCT 


CCAGTGGTCA 


^5401 


CCTCTTCCTC 


TATAAGAGAC AACATGGTTT 


CCACAACAAT 


GCCTGGCTCC 


35451 


TCTGGCATTA 


CAAGGATTGA 


GATAGAGTCA 


ATGTCATCTC 


TGACCCCTGG 


35501 


ACTGAGGGAG 


ACCAGAACCT 


CCCAGGACAT 


CACCTCATCC 


ACAGAGACAA 


35551 


GCACTGTCCT 


TTACAAGATG 


TCCTCTGGTG 


CCACTCCTGA 


GGTCTCCAGG 


35601 


ACAGAAGTTA 


TGCCCTCTAG 


CAGAACATCC 


ATTCCTGGCC 


CTGCTCAGTC 


35651 


CAGAATGTCA 


CTAGACATCT 


CCGATGAAGT 


TGTCACCAGG 


CTGTCTACCT 


35701 


CTCCCATCAT 


GACAGAATCT 


GCAGAAATAA 


CCATCACCAC 


CCAAACAGGT 


35751 


TATTCTCTGG 


CTACATCCCA 


GGTTACCCTT 


CCCTTGGGCA 


CCTCAATGAC 


35801 


CTTTTTGTCA 


GGGACCCACT 


CAACTATGTC 


TCAAGGACTT 


TCACACTCAG 
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Table 1 (continued) 



Genomic QA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



35851 


AGATGACCAA TCTTATGAGC AGGGGTCCTG AAAGTCTGTC ATGGACGAGC 


35901 


CCTCGCTTTG 


TGGAAACAAC TAGATCTTCC TCTTCTCTGA CATCATTACC 


35951 


TCTCACGACC 


TCACTTTCTC CTGTGTCCTC CACATTACTA GACAGTAGCC 


36001 


CCTCCTCTCC 


TCTTCCTGTG ACTTCACTTA TCCTCCCAGG CCTGGTGAAG 


36051 


ACTACAGAAG 


TGTTGGATAC AAGCTCAGAG CCTAAAACCA GTTCATCTCC 


36101 


AAATTTGAGC 


AGCACCTCAG TTGAAATACC GGCCACCTCT GAAATCATGA 


36151 


CAGATACAGA 


GAAAATTCAT CCTTCCTCAA ACACAGCGGT GGCCAAAGTG 


36201 


AGGACCTCCA 


GTTCTGTTCA TGAATCTCAT TCCTCTGTCC TAGCTGACTC 


36251 


AGAAACAACC 


ATAACCATAC CTTCAATGGG TATCACCTCC GCTGTGGACG 


36301 


ATACCACTGT 


TTTCACATCA AATCCTGCCT TCTCTGAGAC TAGGAGGATT 


36351 


CCGACAGAGC 


CAACATTCTC ATTGACTCCT GGATTCAGGG AGACTAGCAC 


36401 


OTCTGAAQAG ACCACCTCAA TCACAGAAAC AAGTGCAGTC CTTTATGGAG 


36451 


TGCCCACTAG 


TGCTACTACT GAAGTCTCCA TGACAGAAAT CATGTCCTCT 


36501 


AATAGAACAC 


ACATCCCTGA CTCTGATCAG TCCACGATGT CTCCAGACAT 


36551 


CATCACTGAA 6TGATCACCA GGCTCTCTTC CTCATCCATG ATGTCAGAAT 


36601 


CAACACAAAT 


GACCATCACC ACCCAAAAAA GTTCTCCTGG GGCTACAGCA 


36651 


CAGAGTACTC 


TTACCTTGGC CACAACAACA GCCCCCTTGG CAAGGACCCA 


36701 


CTCAACT6TT 


CCTCCTAGAT TTTTACACTC AGAGATGACA ACTCTTATGA 


36751 


GTAGGAGTCC 


TGAAAATCCA TCATGGAAGA GCTCTCCCTT TGTGGAAAAA 


36801 


ACTAGCTCTT 


CATCTTCTCT GTTGTCCTTA CCTGTCACGA CCTCACCTTC 
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Table 1 (continaed) 



Oenomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



36851 


TGTTTPTTPP ttr*aT'T'7vr«r«r»ri 7inn/^mnm«/n/^ mm^^'^^^^^^ 

xvjyx ALAiiACLGC AGAGTATCCC TTCCTCCTCT TTTTCTGTGA 


36901 


v,4. AWiciuci UAC-CCCAGGC ATGGTGAAGA CTACAGACAC AAGCACAGAA 


36951 


v.v.ivjvji^\«uft iji A lAiV-iCC AAATCTGAGT GGCACCTCAG TTGAAATACT 


37001 


vj^^-AVj^-^-iui ijAA^aiLAi-CA CAGATACAGA GAAAATTCAT CCTTCTTCAA 


37051 


UAUUfyVTGTG GGAACCACCA GTTCTGGACA TGAACTATAT 


37101 


iui^iuiiaiii CAATCCACTC GGAGCCATCC AAGGCTACAT ACCCAGTGGG 


37151 


lA^iuuciuj. iLUArGGCTG AAACCTCTAT TTCCACATCA ATGCCTGCTA 


37201 


X J. i\aAVjAC CALAGGATTT GAGGCTGAGC CATTTTCTCA TTTQACTTCT 


37251 


vTvsAx X X AVjkaA A.l3ALJ\AALAT GTCCCTGGAC ACCAGCTCAG TCACACCAAC 


37301 


AAAfAPAPPT" 'Vr*'Wr*'vr*r*'vr* ry^rnrtrtj^rvrtr^n r^y^mmmr**^ ^ . 

rtftrti/i\^uux xciii^iccic? (jGTCCACTCA ccttttacag agttccaaga 


37351 


L.xij:axxxcAC ciCrrCTGCA AAAACATCAT CCCCAGACTG GCCTCCAGCC 


37401 


J. wnwav7j.^x A v«xt.xn^iMx xuu AVvXCarviAvJAxA ATCACCCCCT TTAATGPTTP 


37451 


TCCATCTATT ACGGAGTCCA CTGGQATAAC CTCCTTCCCA GAATCCAGGT 


37501 


TTACTATGTC TGTAACAGAA AGTACTCATC ATCTGAGTAC AGATTTGCTG 


37551 


CCTTCAGCTG AGACTATTTC CACTGGCACA 6TGATGCCTT CTCTATCAGA 


37601 


GGCCATGACT TCATTTGCCA CCACTGGAGT TCCACQAQCC ATCTCAGGTT 


37651 


CAGGTAGTCC ATTCTCTAGG ACAGAGTCAG GCCCTGGGGA TGCTACTCTG 


37701 


TCCACCATTG CAGAGAGCCT GCCTTCATCC ACTCCTGTGC CATTCTCCTC 


37751 


TTCAACCTTC ACTACCACTG ATTCTTCAAC CATCCCAGCC CTCCAT6AQA 


37801 


TAACTTCCTC TTCAGCTACC CCATATAGAG TGGACACCAG TCTTGGGArft 
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Table 1 (continued) 



Genomic GA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



37851 GAGAGCAGCA CTACTGAAGG ACGCTTGGTT ATGGTCAGTA CTTTGGACAC 
37901 TTCAAGCCAA CCAGGCAGGA CATCTTCAAC ACCCATTTTG GATACCAGAA 
37951 TGACAGAGAG CGTTGAGCTG GGAACAGTGA CAAGTGCTTA TCAAGTTCCT 



38001 


TCACTCTCAA 


CACGGTTGAC 


AAGAGAATGC 


GCATGGCGAG 


AAGGGAGAAG 


38051 


TGTAGTTGGA 


TGGATAAAAG 


GAAGAATGGA 


GAGAAGAGTG 


AATGGAAGGA 


38101 


AGCAAAGATG 


AAGCGGAGGA 


AGGATAGATG 


CACAGAAGGA 


AGGATGAAAA 


38151 


GAAAGAAAGA 


TGATGGAAGA 


CAGGATTGAA 


GGGGATATAG 


ATTGAAGGAA 


38201 


AGAAAGGTAG 


AAGGATGAAA 


TGAAGTAAAG 


ATTGAAGAAA 


AGATGGATGG 


38251 


AAAGAAGAAA 


GGAG6GTGCA 


CAAAAAATCT 


CACACTTCAC 


CACATATGAT 


38301 


TCATCCATAT 


AAGAAAAAAC 


CACTTGTACC 


CTCAAAGCTA 


TTGAAATACA 


38351 


AACTTTTAAA 


TTAAAATTTT 


AAAAAGCAAG 


AGAAAGGAAA 


GAAGGGAGGA 


38401 


AAGACAAAAG 


GAAGAATGGG 


TGATAGAAGG 


AAAGAATAAA 


AGGAAGAAAA 


38451 


AATGGAAGAA 


TAGATGATCA 


GATCTAGGGA 


TGAATGAAAG 


GAAGGATGGA 


38501 


CAAATCTATA 


GGTAGGTGGA 


TGGATCTATG 


GACAGGTGTG 


GCCACTTATG 


38551 


GCACATAGTC 


CCAGCTCCAG 


TTCATACTGA 


TGGACTTGAG 


GAGTGTTTGT 


38601 


GGCCAATGAA 


GTGGATCCAT 


TTAGACAGTG 


CTCTTCTTCT 


GAATGAGATG 


Exon 5 












38651 


AGTTACCCCA 


GTTTTTCTCC 


CCACCTTCAT 


CTTCAGGAAC 


TGATGGCATT 


38701 


ATGGAACACA 


TCACAAAAAT 


ACCCAATGAA 


GCAGCACACA GAGGTACCAT 


38751 


AAGACCAGTC 


AAAGGCCCTC 


AGACATCCAC 


TTCGCCTGCC AGTCCTAAAG 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



38801 
38851 
38901 
38951 
39001 
39051 
39101 
39151 
39201 
39251 
39301 
39351 
39401 
39451 
39501 
39551 
39601 
39651 
39701 
39751 



GTAGGTTTAA CTTTGCTTAC CTCCCAGTAA TGCCACTCGT GACCATATTT 
CCTCCTCCAG AGAGACAAAA TGTTTGTATT CTTTAGAGAG AGAATTGTGT 
GTGGTTGTCA TAGGTTTCCC TGTCTGAACT GAGTCTTTAT CTAATGGTTA 
CCAGGCAGAT GTTACCACTG TCTCTTTCTC CTCATGGCAT GCTGAGTGAG 
TTTTGTCCAA CATCAAATAT TCACAAATTT GTCCATATTA ACCAAATTTT 
AAAAATGCTC ATTAAAAACT TACTATGAGC TGGGCGCAGT GGCTCATGCC 
TGTAATCCCA ATACTTTGGG AGGCTGAGCT GGGTGGATCA CCAGAGGTCA 
AAAATTCGAG ACCAGTCTGA CCAAAATGGT GAAACTCCAT CTCTACTGAA 
AATATAAAAA TTAGCCGGGC ATGGTGGCAC ACACCGTAAT CACAGCTACT 
CAGGAGGCTG AGGCAAGAGA GTCACTTGAA CCACAGGAGG TAGAGGCTGC 
AGTGAGCTGA GCATTGTGCC AATGCACTCC AGCCTGGGTG GCAGAGCAAG 
ACTCCAGCTC AGAAATAAAT AATATATTAT ATATATATAT ATATGTTTTA 
TTTAGATGGA ATATACTATA TATATATGTA TATATATATG TATGTATATA 
TATATATGTA TGTATATATA TATATATATA TATATATATA TATATAGAGA 
GAGAGAGAGA GAGAGAGAGA GAGAGAGACA GAGTATGTCT GAGAATGCAT 
CCCGATAGTT CTAGCAAGGT AGGAAAAGGA AGTATCATAA CAGCCTTGAA 
GTAGCCTGTT GAAACAGACA GACTCTCTTG TAAGAGAACT CACA7VAATCT 
AGGATTATAT CTCCCATGAT GAAAAATTTG GAACTGTACA TTTTTGTTTA 
ACTGTCACTT AAATNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NCCAGGAGGC ACTGTGCTTG GCGCCTTTTT 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



39801 
39851 
39901 
39951 
40001 
40051 
40101 
40151 
40201 
40251 
40301 
40351 
40401 
40451 
40501 
Exon 6 
40551 
40601 
40651 
40701 



ACCAACACTT TGAGATGGCC ATTGTACTTA TCCCCACTTT ATAGACGGGA 
AAATGGAGGT CCAGCAATAT TTTTTAACTT AAAGAGCCAC CCATCTCTTT 
AGAGA/^GAG CCAGAATCCC AGGCAGGGGC TATCTTATTC CAGAGCCCAA 
GCTCTCAAAC ACATGATACA CAATACTTAA TCTCTCTCAA GTCAGAGGAG 
ATCCACTTAA GTATACATCC ATCCACATAT TCATTCATTC AATCATTCAA 
CAAATATTAG TTGAGCACTT ACCGTATGCC AAACAGTCAA ACGTGAATAG 
CTGTTACAAA TGAGACTGTG AAGGATGGTA CAACGCAGAT TCAGACAGTG 
TGATAAGGAA ATATTGAGAA GCAAAGATGA GTTCTGGAGT GAATTTGTAA 
AGGTGGATGT GGGCTTGGAT TTCAATAATG GCAGAACTTA AGGAATCTGA 
TGAGAAGTGG GCACTTCAGG CAGAGAGAAG AGCTTGAACA AGGCTCAGAG 
GCTGACAGTG CAGGAAACAC ATGGGAAGAG GGAATAGAGT AGCGGTCAAG 
AATTCACAGA GGAGTTATAG GTGAAGATGC AACCAAGTTA CAGACCAAGG 
TAAGATAGGG GAATACCAAT CACAATCTCT TTTCCCATTC CAGAAGCATC 
CCAGACACAT CCTAGTAACC GAGAGACATT TCTCTCCCTT TCCTCCTGTG 
GAGAATAAAT AAGCTATTGC AAGTCCAGTA AGTGTAATCA TTTTGTTCAA 

ATTGTGTGCC CATTCCCCAA TTTACA GGAC TACACACAGG AGGGACAAAA 
AGAATGGAGA CCACAACCAC AGCTCTGAAG ACCACCACCA CAGCTCTGAA 
GACCACTTCC AGAGCCACCT TGACCACCAG TGTCTATACT CCCACTTTGG 
GAACACTGAC TCCCCTCAAT GCATCAATGC AAATGGCCAG CACAATCCCC 
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Table 1 (continaed) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



40751 


A.CA.GAAATf5A TnATT^ZiPZVar' PT'/^TVTTV'P/^Tirn nnfn/^/^rwn/-i«m^ mmMM-«M«««M 

J. utf^ x\3i\x\^\.j\i\\^ v-uw\lAivjj.T TTCCCTGATG TTCCAGAAAC 


40801 


v3nv»n,x^\«j,uA J. ivjv^dAULA ouCaCaGGAGC AGAAACCAGC ACAGCTCTTC 


40851 


i-uA\^A<wArt.L. UL^UAiurGTT TTCAATAGAG AATCAGAGAC CACAGCCTCA 


40901 


^-x^avjxi-iUiv^ vjiiLTGGGGC AGAGAGAAGT CCGGTTATTC AAACTCTAGA 


40951 


Av3fj.\aA.urC*UAG ATACAACAGC TTCATGGGTT ATCCATCCTG 


41001 


wnoAvaMv.v^Ax v-i-uftAi-Uji i ALCAA.GACAA CCCCCAATTT TTTCCACAGT 


41051 


vji^i X cxvjXAlCiTC CACAGCCACC AGTCATGGGG CAGACGTCAG 


41101 


v-x(-AJaV-L^xx i-V-AACAAATA TCTCACCTAG TGAACTAGAT GCACTGACCC 


41151 


<^i^XLiV7XCA(^ iAlXTCGGGG ACAGATACtA GTACAACATT CCCAACACTG 


41201 


A<-xAAV5Xi.,i..u uAuAlQiAAAC AGAGACAAGA ACCACATGGC TCACTCATCC 


41251 


xurwiijALxAt^u AtjUiLAACTA TTCCCAGAAC AATCCCCAAT TTTTCTCATC 


41301 


ATGAATC!ARA TYlT'PAPliPr'T Tr«7VA»P3vr»r«r«A r^r^iv/^mi^rwi*-!*^ %i .^.^ 
Axvxnnj. wivan x\7i.,uftuAU^^ X xuAAlAGCuA CCAGTCCTGG GGCAGAAACC 


41351 


agttcagcta ttccaattat gactgtctca cctggtgcag aagatctggt 


41401 


GACCTCACAG GTCACTAGTT CTGGGACAGA CAGAAATATG ACTATTCCAA 


41451 


CTTTGACTCT TTCTCCTGGT GAACCAAAGA CGATAGCCTC ATTAGTCACC 


41501 


CATCCTGAAG CACAGACAAG TTCGGCCATT CCAACTTCAA CTATCTCGCC 


41551 


tgctgtatca cggttggtga cctcaatggt caccagtttg gcggcaaaga 


41601 


CAAGTACAAC TAATCGAGCT CTGACAAACT CCCCTGGTGA ACCAGCTACA 


41651 


ACAGTTTCAT TGGTCACGCA TCCTGCACAG ACCAGCCCAA CAGTTCCCTG 


41701 


GACAACTTCC ATTTTTTTCC ATAGTAAATC AGACACCACA CCTTPAATfiA 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



41751 


CCACCAGTCA 


TGGGGCAGAA TCCAGTTCAG 


CTGTTCCAAC 


TCCAACTGTT 


41801 


TCAACTGAGG 


TACCAGGAGT AGTGACCCCT 


TTGGTCACCA 


GTTCTAGGGC 


41851 


AGTGATCAGT 


ACAACTATTC CAATTCTGAC 


TCTTTCTCCT 


GGTGAACCAG 


41901 


AGACCACACC 


TTCAATGGCC ACCAGTCATG 


GGGAAGAAGC 


CAGTTCTGCT 


41951 


ATTCCAACTC 


CAACTGTTTC ACCTGGGGTA 


CCAGGAGTGG 


TGACCTCTCT 


42001 


GGTCACTAGT 


TCTAGGGCAG TGACTAGTAC 


AACTATTCCA 


ATTCTGACTT 


42051 


TTTCTCTTGG 


TGAACCAGAG ACCACACCTT 


CAATGGCCAC 


CAGTCATGGG 


42101 


ACAGAAGCTG 


GCTCAGCTGT TCCAACTGTT 


TTACCTGAGG 


TACCAGGAAT 


42151 


GGTGACCTCT 


CTGGTTGCTA GTTCTAGGGC AGTAACCAGT 


ACAACTCTTC 


42201 


CAACTCTGAC 


TCTTTCTCCT GGTGAACCAG AGACCACACC 


TTCAATGGCC 


42251 


ACCAGTCATG 


GGGCAGAAGC CAGCTCAACT 


GTTCCAACTG 


TTTCACCTGA 


42301 


GGTACCAGGA 


GTGGTGACCT CTCTGGTCAC 


TA 'TTCTACr 


GGAGTAAACA 


42351 


GTACAAGTAT 


TCCAACTCTG ATTCTTTCTC 


CTGGTGAACT 


A6AAACCACA 


42401 


CCTTCAATGG 


CCACCAGTCA TGGGGCAGAA 


GCCAGCTCAG 


CTGTTCCAAC 


42451 


TCCAACTGTT 


TCACCTGGGG TATCAGGAGT 


GGTGACCCCT 


CTGGTCACTA 


42501 


GTTCCAGGGC 


AGTGACCAGT ACAACTATTC 


CAATTCTAAC 


TCTTTCTTCT 


42551 


AGTGAGCCAG AGACCACACC TTCAATGGCC 


ACCAGTCATG 


GGGTAGAAGC 


42601 


CAGCTCAGCT 


GTTCTAACTG TTTCACCTGA 


GGTACCAGGA 


ATGGTGACCT 


42651 


CTCTGGTCAC TAGTTCTAGA GCAGTAACCA ^TACAACTAT TCCAACTCTG 


42701 


ACTATTTCTT 


CTGATGAACC AGAGACCACA ACTTCATTGG 


TCACCCATTC 
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Table 1 (continued) 



Genomic CA125 i^ino Terminal Sequence 
(SEQ ID NO: 1) 



*tA /DX 


TGAGGCAAAG 


ATGATTTCAG CCATTCCAAC TTTAGCTGTC 


TCCCCTACTG 




TACAAGGGCT GGTGACTTCA CTGGTCACTA GTTCTGGGTC AGAGACCAGT 


AO Q C 1 


GCGTTTTCAA 


ATCTAACTGT TGCCTCAAGT CAACCAGAGA 


CCATAGACTC 




ATGGGTCGCT 


CATCCTGGGA CAGAAGCAAG TTCTGTTGTT 


CCAACTTTGA 


42951 


CTGTCTCCAC 


TGGTGAGCCG TTTACAAATA TCTCATTGGT 


CACCCATCCT 


A O A rt1 


GCAGAGAGTA 


GCTCAACTCT TCCCAGGACA ACCTCAAGGT 


TTTCCCACAG 


43UdX 


TGAATTAGAC 


ACTATGCCTT CTACAGTCAC CAGTCCTGAG 


GCAGAATCCA 


A*a 1 m 
4jXUX 


GCTCAGCCAT 


TTCAACAACT ATTTCACCTG GTATACCAGG 


TGTGCTGACA 


4olbl 


TCACTGGTCA 


CTAGCTCTGG GAGAGACATC AGTGCAACTT 


TTCCAACAGT 


43201 


GCCTGAGTCC 


CCACATGAAT CAGAGGCAAC AGCCTCATGG 


GTTACTCATC 


4J25X 


CTGCAGTCAC 


CAGCACAACA GTTCCCAGGA CAACCCCTAA 


TTATTCTCAT 


' 3301 


A6TGAACCAG ACACCACACC ATCAATAGCC ACCAGTCCTG GGGCAGAAGC 


43351 


CACTTCAGAT 


TTTCCAACAA TAACTGTCTC ACCTGATGTA 


CCAGATATGG 


43401 


TAACCTCACA 


GGTCACTAGT TCTGGGACAG ACACCAGTAT 


AACTATTCCA 


43451 


ACTCTGACTC TTTCTTCTGG TGAGCCAGAG ACCACAACCT 


GATTTATCAC 


43501 


CTATTCTGAG ACACACACAA GTTCAGCCAT TCCAACTCTC 


CCTGTCTCCC 


43551 


CTGGTGCATC 


AAAGATGCTG ACCTCACTGG TCATCAGTTC 


TGGGACAGAC 


43601 


AGCACTACAA CTTTCCCAAC ACTGACGGAG ACCCCATATG AACCAGAGAC 


43651 


AACAGCCATA CAGCTCATTC ATCCTGCAGA GACCAACACA ATGGTTCCCA 


43701 


GGACAACTCC 


CAAGTXTTCC CATAGTAAGT CAGACACCAC 


ACTCCCAGTA 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO* 1} 



43751 


GCCATCACCA 


GTCCTGGGCC AGAAGCCAGT 


TCAGCTGTTT 


CAACGACAAC 


43801 


TATCTCACCT 


GATATGTCAG ATCTGGTGAC 


CTCACTGGTC 


CCTAGTTCTG 


43851 


GGACAGACAC 


CAGTACAACC TTCCCAACAT 


TGAGTGAGAC 


CCCATATGAA 


43901 


CCAGAGACTA 


CAGCCACGTG GCTCACTCAT 


CCTGCAGAAA 


CCAGCACAAC 


43951 


GGTTTCTGGG ACAATTCCCA ACTTTTCCCA 


TAGGGGATCA 


GACACTGCAC 


44001 


CCTCAATGGT 


CACCAGTCCT GGAGTAGACA 


CGAGGTCAGG 


TGTTCCAACT 


44051 


ACAACCATCC 


CACCCAGTAT ACCAGGGGTA 


GTGACCTCAC 


AGGTCACTAG 


44101 


TTCTGCAACA 


GACACTAGTA CAGCTATTCC 


AACTTTGACT 


CCTTCTCCTG 


44151 


GTGAACCAGA GACCACAGCC TCATCAGCTA 


CCCATCCTGG 


GACACAGAGT 


44201 


GGCTTCACTG 


TTCCAATTCG GACTGTTCCC 


TCTAGTGAGC 


CAGATACAAT 


44251 


GGCTTCCTGG 


GTCACTCATC CTCCACAGAC 


CAGCACACCT 


GTTTCCAGAA 


44301 


CAACCTCCAG TTTTTCCCAT AGTAGTCCAG ATGCCACACC TGTAATGGCC 


44351 


ACCAGTCCTA 


GGACAGAAGC CAGTTCAGCT 


GTACTGACAA 


CAATCTCACC 


44401 


TGGTGCACCA 


GAGATGGTGA CTTCACAGAT 


CACTAGTTCT 


GGGGCAGCAA 


44451 


CCAGTACAAC 


TGTTCCAACT TTGACTCATT 


CTCCTGGTAT 


GCCAGAGACC 


44501 


ACAGCCTTAT 


TGAGCACCCA TCCCAGAACA 


GAGACAAGTA AAACATTTCC 


44551 


TGCTTCAACT 


GTGTTTCCTC AAGTATCAGA 


GACCACAGCC 


TCACTCACCA 


44601 


TTAGACCTGG 


TGCAGAGACT AGCACAGCTC 


TCCCAACTCA GACAACATCC 


44651 


TCTCTCTTCA 


CCCTACTTGT AACTGGAACC 


AGCAGAGTTG 


ATCTAAGTCC 


44701 


AACTGCTTCA 


CCTGGTGTTT CTGCAAAAAC 


AGCCCCACTT 


TCCACCCATC 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID MO: 1) 



44751 
44801 

44851 

44901 

44951 

45001 

45051 

45101 

45151 

45201 

45251 

45301 

45351 

45401 

45451 

45501 

45551 

45601 

45651 

45701 



CAGGGACA6A AACCAGCACA ATGATTCCAA CTTCAACTCT TTCCCTTGGT 
TTACTAGAGA CTACAGGCTT ACTGGCCACC AGCTCTTCAG CAGAGACCAG 

CACGAGTACT CT/U^CTCTGA CTGTTTCCCC TGCTGTCTCT GGGCTTTCCA 
GTGCCTCTA T AACAACTGAT AAGCCCCAAA CTGTGACCTC CTGGAACACA 
GAAACCTCAC CATCTGTAAC TTCAGTTGGA CCCCCAGAAT TTTCCAGGAC 
TGTCACAGGC ACCACTATGA CCTTGATACC ATCAGAGATG CCAACACCAC 
CTAAAACCAG TCATGGAGAA GGAGTGAGTC CAACCACTAT CTTGAGAACT 
ACAATGGTTG AAGCCACTAA TTTAGCTACC ACA6GTTCCA GTCCCACTGT 
6GCCAAGACA ACAACCACCT TCAATACACT GGCTGGAAGC CTCTTTACTC 
CTCTGACCAC ACCTGGGATG TCCACCTTGG CCTCTGAGAG TGTGACCTCA 
AGAACAAG TA A6AATAACTT TTTTATTGTG GTAAAATATA AATACTATAA 
AAATTGCCAT TCTAAACATT TTAATTGTAC AACTCAGCAG TACTAATACA 
TTCACATTGT TGTGCAACCC TCACCACTAT CTGTTTTCAA AACTTTTTTT 
ATCACCCCAA ACAGGACTGA AGGAATAATT TCCCATTCCC CATTCTCCCT 
AGTGCAGTGG TGCAATCTCG GCTCACCACA ACCTCTGAAC CTCTGTCTCC 
TGGGTTCAAG CAATTCTCCT GCATCAGCCT CCTGAGTAGT TGGGACTACA 
GGTGCACGCC ACCGTGCCTG GCTAATTTTT GTATTTTTAG TACAGACAGG 
GTTTTACCAT GTTGGTCAGG CTGGTCTCAA ACTCCTGACC TCA6GTGGTC 
CACACGCCTT GGCCTCCC2\A AGTGCTGGGA TTACAAGTGT GAGACACTGT 
GCCCGGCCAT ATCTGTTAGA TCTTACTAAT CCTGTCAAGA GGATTCAGTG 



63 



wo 2004/045553 



PCT/US2003/037041 



Table 1 (continued) 

Genomic CIA125 Amino Tenainal Sequence 
(SEQ ID NO: 1) 

45751 TCCTTTTTTT TTTTTCTTTC TTTTTTTTGA TAGAGTCTCC CTCTGGCACC 

45801 CAGGCTGGAG TGCAGTGGTA CGGTCTTGGC TCACTGCAGC CTCCACCTCC 

45851 C7W3ACTGAAG CGATTCTCCT GCCTCAGCCT CCCGAATAGC TGGGACTACA 

45901 GGCGCGTGCC ACCACGCCCA GCTAATTTTT GCATTTTTAG TAGAGATGGG 

45951 ATTTCACTAT GTTGGCCAGG CTGGTCTCAA ACTCCTGATC TCAAGTGATC 

46001 CGCCCAAGGG CCTCCCAAAG TACTGGGATT ACAGGTAGGA GCCACCTCAC 

46051 CTGGCCCTAT TTTCGGAATG GATTTTTTTT TAATGTTTAA AATGTCACCT 

46101 AAGATTATTG TGAAGATCAA ATAAGATAAA ATCCTAATAA CCCAAGTAAA 

46151 CCACAGGGCT CCACTTGGAC CAGTCTCAGA AGTTTCAAGA AAATCAGTCA 

46201 GACCATCAAA TGTAAAATAA GTCTAAATTT TCTTTGCACT ATTCACAGAG 

46251 TGCCAAAGAG GATCTAATTC ATGTTTCAGA ACATACCCTA CTTACTAAAA 

46301 TCCCCTTTTC CTCATTTCTT CTCATTCTGC AACTTTATCA TCTCCTGCGG 

46351 ACCCCCTAGC CTCTCCCCTC CCCATAGTCA GTCTCTCTCT CTCTCTTTCC 

46401 CTCCCCTCTT ATTATCTCAA TTTCACACGA AAGAATTCCA GAAACTATAC 

46451 TGCCAAAAGT CTTTCCTGTC TTTGAAAAGT TGGGAAAGAG GAGAAACTCA 

46501 GACAGCAATG ACAAAATTAT ACGTAATGGA TGAAGGAAAC ACAAATAAGG 

46551 CTGGAAACAG T^AAATTTTGT CCCCATCATT TATTTAATGA AGGTGGCAGT 

46601 ATTCCAGCCA CATAGTGA^VC CCCCACAATA AGAAGGGGCC TCTGGCGATT 

46651 GATTATTGTC ATTGTTGTTA ATGATAATGA GGGTGAGGAT ATCATGAGCA 

46701 TCAGTGTAGG AGGCAGTTAA CTAATAAGAC CAAGCTGTTG GCTGGGCGTG 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



46751 
46801 
46851 
46901 
46951 
47001 
47051 
47101 
47151 
47201 
47251 
47301 
Sxon 7 
47351 
47401 
47451 
47501 
47551 
47601 
47651 



CTGGTTCACA 


CCTGCAGTCC 


CAGCACTTTG 


GGAGGCCAAA 


GTGGGTGGAT 


CACTTGAGGT 


CAGGAGTTCA 


AGACTAGCCT 


GGCCAACATG GTGAAACCTG 


GTCTCTACCA 


AAAATACAAA 


AATTAGTCAG 


GTGTGGTGGC 


GTGTGCCTGT 


AATGACAACT 


ACTTGGGAGG 


CTGAGGCAGG 


AGAATCACTT 


GAACCTGGGA 


GGCGGAGGCT 


GCAGTGAGAT 


GAGCTTGAAC 


CACTGCACTC 


CAGCCCGGGC 


AACAGAGAGA 


GACTCTTGTC 


TCAAAAAACA 


AAACAAACAA ACAAAAACTA 


AACCAT^CAA 


AAA7\AQACTA 


GCTGTTATTC 


ATTTATTTAT 


TTATTTATTT 


AGAGACGGAG 


TCTCGCTCTG 


TCACCCAGGC 


TGGAGTGCAG 


CGGCACAATC 


TTGGCTCACT 


GCAACCTCTG 


CCTCCCAGGT 


TCATGTGATT 


CTCCCGCCTC 


AGCCTCCCCA 


GCTGTTGTTA 


TTCATGAATG 


AACCTCAGAG 


AAAGCACACA 


GGAGGGTTGG 


TGCACCTGTG 


TTTTGAGTTC 


TACCCCTCCT 


TCCTCTCTTA 


ACTTCCTCCT 


GTCTTCTCAC 


TCTGATTCGT 


TCTTCCTTCC 


TCTCCCTCTC 



TCTCTGCA GG TTATAACCAT CGGtCCTGGA TCTCCACCAC CAGCAG TGAG 
TAAACATGGC CCTGAAGTCC CTATGCCCTG GGAATTCTTC CTCCCTAAGC 
CTGCCTTCCA GGAGGAAAGT ATCCCCCATT CCCTAGGTTC TCATCCCCAC 
AGAAACTCCA GAATAGCAAA AGTCTCAGGC TGAGCCAAGG CACAGATGCC 
AGTGCTCACC AAGAGTCCTA TTCTCCCCTC GCTAA^TGAT AGGACCCAAC 
AAACCCGATT CACGCTGCGT TTTCTTTCAG CTCCGATGAC CTCCATGTTC 
TCTCCAAGGC CTCTCGTATC TGTGAGCCCC ACCCCCAGCG CTACAGGTAG 
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Table 1 (continued) 

Genozoic CA125 Axoino Terminal Sequence 
(SEQ ID NO: 1) 

47701 GAATCTGGCT TCCAGCTCCC ATGAAACGTC GGCTGCCATT CAGTGGCTGA 

47751 TTAATTGCTG TGTGGTCTGA GTCCTGATGC CCACCAAGTC TCAGCGTGTT 

47801 CCCCTCTGTC CAATCTCATC CAACAATTTA AGCTAATGCT TGTTTAATGA 

47851 TGTCCTCACT ATACCACCTT GGACACTTTC TTTTTGCCTG GATTTAAAGC 

47901 TTCCATTTCT TTCCTTCCTT CCTTCTTTTC TTCCTTCCTT CCTTCCTTCC 

47951 TTCCTTCCTT CCTTCCTTCC TTCCTTCCTT CCTTCCTTCC TCCTTCCTTC 

48001 CTTCCTTTCT TCCTTTCTTC CTGTCTTTTT CTTTCTTTCC TTCTTTTGGC 

48051 AGAGTCTCAC TCTGTCGCCC AGGCTGGAGT GCAATGGTGC AATCTCGGTT 

48101 CACTGCAACC TCTGCCTCCC AGGTTCAAGC GATTCTCATG CCACATGCCA 

48151 CTATGCCTGG CTAATTTTTG TTTTTTTGTT TTTTGGGGGG TTTTTTGAGA 

48201 CAGAGTCTCA GTCTGTTGCC CAAGCTGGAG TGCAGTGGCA TGATCTCGGG 

48251 TCACTGCAAC CTCCTTCTCC CAGGTTCAAG CGATTTTCCT GCCTCAGCCT 

48301 CCTGAGTAGC TGGAACTACA GGCACGCACC ATCACACCGG CTAATTTTTT 

48351 GTGTTTTTAG TAGAGACGAC GGTTTTGCAA TGTGGGCCAG GCTTGTCTCG 

48401 AACTCCTGAC CTCAA6TGAT CCTCCAGCCT CGGCCTCTCA AAGTGCTGGG 

48451 ATTACAAGTG TGAGCCACTG CACCAGGCCA AAAACTTGTA TTTCAATAGT 

48501 CATTGAGGCT GGGTGCAGTG GCTCACGCCT GTAATCCCAG CACTTTGGGA 

48551 GGCTGAGGCC AGTGGATCAT GAGGTCAGGA GATCAAQACC ACCCTGGCTA 

48601 ACACAGTGAA ACCCCATCTC TACTAAAAAT ACACACAAAA ATTAGCCGGG 

48651 CATGGTGGCA AGATGCCTGT AGTCCCAGCT ACTCAGQAGG CTGAGGCAGG 
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Table 1 (continued) 



Oenomic CA125 Amino Terminal Sequence 
(SEQ ID NO: 1} 



48701 

48751 

48801 

48851 

48901 

48951 

49001 

49051 

49101 

49151 

49201 

49251 

49301 

49351 

49401 

49451 

49501 

49551 

49601 

49651 



AGAATGGCGT GAACCTGGGA GGCAGAGCTT GCAGTGAGCG GAGATCGCAC 
CGCTGCACTC CAGCCTGGGC AACAGAQAGC GACTCTGTCT CAAAAAAAAA 
AATATATATA TATATATATA TATATTCATT GAGACCGACT CTGACTTAAA 
AGCAGTAATG AATGGTGTAG GTTTTGGTAA ATTACAGGTC TTGCTTTAAG 
TCCTGGTCCT CTCTTTTGCT CACT6TGTGG CCCCGGAAGA GCCATGTAAC 
CTCTCCAGGC 'TTCAGTGTCC ATTTTTA6AA CGGAGTAAGT GAATAAGCTG 
TGTCCAATCA TCTCTGGCCA TATCAGCTTC ATTTTTTTTT TCCTCCAGGG 
TCCAAACATC CCTCCACCCT CAQAGTCTTT GCACCTG6TG TTCTTGTCCT 
TCAAATCTCA GCTTGGATCA CCCTTTATAA AGTAGCATTT CCCCCGTATA 
CX3CATCTT6C ACACAGCCAA TCTCTATTCT ACCTCTATGC TCACTTCCTT 
CCTGGCAATT ATTACTACAG CTGGGCCCTT GAACAGCATG AGGGTTCAGG 
GTGCTGACCC CTATGCATTC AAAAATCCAC ATATAACTTT TTTTTTTTTG 
AGATGGAGTT TCACACTTGT TGCCCAGGCT GGAGTGCAGT GGCGCCATCT 
TGGCTCACTG CAAACTCT6C CTCCTGGGTT CAAGTGATTC TCCTGCCTCA 
GCCTCCTGAG TAGCTGGGAT TACAGQCATG TGCCACCATG CCCAGCTAAT 
TTTGTATTTT TAGTAGAGAT GAGGTTTCTC CATGTTCGCC AGGCTGCTCT 
TGAACTCCTG ACTTCAGGTG ATCCGCCTGC CTTGGCCTCC CAAAGTGCTG 
GGATTACAGG CATGAGCCAT GATGCCCX3GC CATTTGCTAA TGGCATCTAG 
TAA6TAGAGG CCAGAQATGT TGCAAAACAT CCAACAATGC ACAAAGCAGC 
CTCCTATCAA AACACATTAT CCAGACCAAA ATGTCAATAG GGCTGAGGTT 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SEQ XD NO: 1) 



49701 GAGCATCTGC TGTACACAGA TTCCAAGTTC TGGTACAAAT CTCGTAGTTC 

49751 TCTGAGGGCT CATCTTTCaA TGCCTAGCAC ATCAAAGGAG GCCAATTTCC 

49801 TCTTCCCTTT CACCTCCTGG TATGAAATGT TTCCTCCTCC ACCTTGATCC 

49851 TGTAAGAGCC CAGCTGGAGT TTGCAGACGA CGGGGAAAGA AATGGGTGAG 

49901 GGAGGGTCCT ATGGTTGAGT CTCCGCAGTG GGCCCTGGGT GCCCAGTTCA 

49951 CCCTCCTCCC CTTCATTTTC TCCATCATGA CAACTCAAGG CAAATTCTCA 

50001 GTTTCCATGG GCCAGTGGAA TCCACTGACT TCATGAAATA ACCCCACCCT 

50051 GAGCAAATAC CCCTCAAATA ATAACTGTTT ACACAACATC AGTGGCAACA 

50101 ATGACCCAAG CAGCAATGCC ACCACCAGAA TAGCAACCAT AACAGCAGCT 

50151 CATTTTCATC AAAAGGAAAC TGTAGGGCCA GGCACAGTGG CTCACACCTA 

50201 TATTCCCAGC ATTTTGGGAG GCTGAGGCAG GCAGATCACC TGAGGTCAGG 

50251 AGTTCAAGAC CAGCCCAGCC AACATGGTGA AA* TCCATCT CTACTAAAAA 

50301 TACAAAAACT AGCCAGGCTT GGTGGCATGT GCCTGTAATC CTAGCTACTC 

50351 GGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CTGGGAGGCA GAGGTTGCAG 

50401 TGAGCTGAGA TTGTGCCACT GCACTCCAGC CTGGGCGACA GAGCAAGACT 

50451 CCGTCTGAAA AAAAAAAAAA AAGGAATTGT GCCAGGAATT GTGATGAGAA 

50501 CTTTATATGC ATTATCTCCT ATTAATATTA CCCAAACCTC CGTGAGTTAC 

50551 TATACTCATT TCTACAGAGA GCATTTATGC ATCCAGGGAG GAAGTAATTA 

50601 GCCCAGAATT ACTCAGTTAT GACACAGGAC AGTATGAAAA CTCCAACCGA 

50651 AGATTGGAGA CTCATGAAAA CTCCAGGCTC CTAACTACAA GACATCACTG 
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Table 1 (continued) 



Qenomlc CA125 Amino Texmlnal Sequence 
(SEQ ID MO: 1) 

^ ) 



50701 


TGGATCGTCC 


AAATAGAGCA 


AGCCCCAATC 


TCAGGACAGG 


AATGAGGCAT 


50751 


GAATGGCCTC 


TATGCTAATG 


ATCTAACCTA ATGCTGAATT 


TGTTACTTCC 


50801 


CTTCTGAATC 


CACTTGGAGA 


TTTCCTTTAT ATCTGACTTG AAATAGAGGA 


50851 


TATATACTCC 


TCTATCCTTG 


ACATAGGAGA TAATACACAG AAAGTATTTC 


50901 


ATTGTAGTAT 


CAAGTACACA 


TCCTGTTCTG 


TGTCCATAGG 


ATTATGACTA 


50951 


ATTTAGGGCA 


TGGCTTAACA 


GTGTGGTACT 


ATTGAATGAC 


AGACAGATGT 


51001 


CTGTTTTGTT 


GGATGCAGGA 


CAAGCCATGT 


AACCTCCCCA 


GACTTTAGTG 


51051 


TCCCCTCTGT 


GGAATGGAAT 


AAAAATACTA 


CGTGGGATTG 


TTCTGATAAT 


51101 


CAAATGAGAT 


AATTCAGGAA 


CAACCCAGAT 


AAATAACAGG 


GCTGCCCTGG 


51151 


GTTCTGTCTT 


TCCTTGTATC 


TCTCACAGAG 


CCTCAAAGGA 


GATGCAATCC 


51201 


ATGACCTAGA 


GAAACACTCA 


GGACAAATTC 


TCTTTTCCCC 


AGTTCCTTTC 


5 -.251 


TTGCTCCAAT 


GGCAACACCA 


CCCCTCTCAT 


CCTGAAGTCT 


CTTGTTTTTA 


51301 


CCACCACACC 


TATTTTGCCA 


AATTTTCTCC AATATTCCAA ACCATATGAA 


51351 


ACCTTTCTTT 


CTTTCTTTTC 


TTTCCTTCCT 


TTCCTTCTTT 


CTTTCTTTTT 


51401 


TCTCTTCTTT 


TCTTTTCTTT 


TTGAGACATG GTCTCACTCT 


GTTGCACAGG 


51451 


CTGGAGTGCA 


ATGGCACGAT 


CTTTGCTCAC 


TGCAACCTCC 


GCCTCCCAGG 


51501 


TTCAAGAGAT 


TCTCTTGCCT 


CAGCCTCCTG 


AGTAGCTGGG 


ATTACAGGCG 


51551 


CCCACCGCCA 


CGCCACGCTA 


ATTTTTGTGT 


TCTTAGTGGA GACGGGGTTT 


51601 


CGCCATGTTG 


GCCAGGCTGG 


TCTTGAACTC 


CTGACCTCAA 


GTGATTTGCC 


51651 


CATCTCGGTC 


TCCCAAAGTG 


CTAGGATTAC 


AGGCGTGAGC 


CACCAAGCCC 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SSQ ZD NO: 1) 



51751 
51801 
51851 
51901 
51951 
52001 
52051 
52101 
52151 
52201 
52251 
52301 
52351 
Exon 8 
52401 
52451 
52501 
52551 
52601 



GGCCCCATAT GAACCGTTTC TATCCCTCAT TTCTCTGTAC TTTTACCTAA 
AAACACCACT CCCTTCACCC ATCACATTTT TGTCAATTCT ACATCACACA 
CACACACACA CACACACACA CACACACAGA GAAAGTAAGT TGGAAAAAAA 
TTATACTATC ATGAAATTTT GTGAAAGGAG GTAAGCTGAG AGAGTAAGAA 
TCAAACTAAA TTATCTTTAT GGGTAGAAAG CACACTCATC CATACATGTG 
TCTTTCCACC CTTGTAATGT ATTTATTATT ATTGTTTGTA TATACTAGAT 
TCCCAATAAA TAGGGACAGC TATTATGGTA TTTTTATTTC AGGAATAATA 
ATAGTGATGA TTTCCACCAT TATTGTCAAA GGACT^GCA CAAAATATGT 
ACCAAATAAA ATATAGCCAT TATCCTTTAT TCACAAAAGA TCTTGGCCCC 
ACCTCTTCTC AATGAAATGT CCATGACTTG TTCAACTTTG GCCACTCTGG 
GCTGAGAGAT GGAGGTTCCC TTGCGAGCTG AAGTCACACA TCGAAGGTGG 
AAGCCCCTCC CCTCCCTCTG GCTGGCTGAG GGATAGCCCA GATGGGCTCA 
TCATGAAAGT TTCCCATTAT TTCCATTTCT GGATCTACCA TCTTCCCCTC 
CCCTACCTCT CACCCATCAT AATTGTCCTT CTTTACTCTT TCCTCCCTAT 

CTGCA GGTTA TAACCGTCGG TACTGGACCC CTGCCACCAG CAG TGAGTAT 
TCAAACCTGT GATATTCCAA TGCCCTTGGG ACCCTTCCTC CCCAAGGTGC 
ATTCCTCAGA AGAGAAACTG ATCATTCTCC CTCCCTACGT GCCCAGCCAC ' 
AGCCTCAGAG CAGCCCCTAA CCCGTCAAGG TCTTGGTGTG AGTCAAGATA 
GAAGTCCAAA TTCCAATGAG CAGTTCCTGT CCCATATTCC TTTAGGAAGA 
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Table 1 (continued) 



Genomic CA125 Amino Terxainal Sequence 
(SEQ ID NO: 1) 



Exon 9 



52651 
52701 


CACCCAATCA TTTCTCCATG TTCTTTTTTT CTCAGCTCCA GTGACTTCTA 
CATTCTCCCC AGGGATTTCC ACATCCTCCA TCCCCAGCTC CACAGGTAGG 


52751 


AAGCTCCTCT 


CTGGCATCTA TGAAATTTAA CACTGCATGG TCTGTTCCCT 


52801 


GCTGACCACC 


CAGACTCAGC CTGTTCCACT CGCCCTCTCA CTCTCTCTCT 


52851 


CTCTCTTTTT 


TTTTTTTTTT TTTTTTTTTT TTTACGGAGT CTTGCTCTGT 


52901 


CACCCAGGCT 


GGAGTGGAAT GGTGTGATCT CGGCTCACTG CAACCTTCGC 


52951 


CTCCCAGGTT 


CACGTGATTC TCCTGCCTCA GCCTCCGGAG TAGCTGGGAT 


53001 


TACAGGTGCA 


CACCACCATG CCTGGCTAAT TTTTTGTATT TTTAGTAGAG 


53051 


ACGGGGTTTC 


ACCATGTTGG CCAGGCTGGT CTTGAACTCC TGACCTCAAG 


53101 


TGATCTACCC 


ACCTTGGCCT CCCAAAGTGC TGGGATTATA GGCATGAGCC 


53151 


ACCACGCCAG 


GCCCACTCTC TAAATTTTGA CCACCCTGCC TTGAGTGGTC 


53201 


TTCTAGCACC 


CTAACCTCTG TCTAACCTCG AGAGCTTTGC ACTAGCGATT 


53251 


CCTGGGGACC 


AGCTATGGTT GGTATCTTCT CAACTTTCTA ATTTTTTTAA 


53301 


AATTATTATT 


ATTATTATTA TTATTTTAAA TGGAGTCTCG CTCTGTCACC 


53351 


CAGGCTGGAG 


TGCAGTGGCA CCATCTCGGC TCATTGCAAC CTCTACCTCC 


53401 


CGGGTTCATG 


CAATTTTCCT GCCTCAGCCA GAAATTTTCT CAGTGGTCGA 


53451 


GATTGTGCCA 


CTGCACTCCA GCCTGGGCAA TGGAGCTAGG CTCCATCTCA 


53501 


AAAAAAAAAA 


AAAAAAGACG GAGGTCGGGC ATTCCTAACC CTTAACCCTG 


53551 


CCTTGTGATT 


CTGGAGTTAT GAGATAGAAC CTGGTGTCCC GTAATTAAAA 
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Table 1 (continued) 



Genomic GA125 Amino Terminal Sequence 
(SEQ ID NO: 1) 



53601 TTCCGCCTTC AGGCCTTATG TTTTGTGAGT CACAACACTG CAAACTTTTT 

53651 ACATGCTGTA GACAGGATGT TCACTCTCCA CTTCCTCACT GCTCTGCTCT 

53701 AATCAATTCA ACCATTTATG TGACATGCCT AACCCCTCTG GGCTTGTACG 

53751 TATGTAACAT GTATTACAAA GCAAGTCATT CCATGATCAA TGCTGTCACT 

53801 TTTTCTAGGT GCTTTCAAAA TTTGTTCTTC ATCATTGATT TTCAGTAGTT 

53851 TGATTACGAT GTGTCTGGGC ATGGTTTTCT TTGAGTTTAT CCTGCTTAAA 

53901 GTGTTCTCAG CTTCTTGAGT CTCAAAGTGT TTATTTTCTG CTCTGATTCT 

53951 TTCTCCCCTT CGGACCTCCA ATGAAATGAT GTTGCCCGAA GAGACCCTGA 

54001 GGTTCTGTTC ATTTTGTTAT TTATCAATCT TTTTTCCTCT CCGAATTTCA 

54051 GGTTTAATAA tTTTTTTTTT TTTTTTGAGA CGGAGTCTCG CTCTGTCGCC 

54101 ChGGCTGGAG TGCAGTGGCG CGATCTCGGC TCACCGCAAG CTCCGCCCCC 

54151 TGGGTTCACG CCATTCTCCT GCCTCAGCCT CCGGAGTAGC tgggattaca 

54201 GGCACCCGCC ACCATGCCCG GCTAATTTTT TGTATTTTTT AGTAGAGACX3 

54251 GGGTTTCACC GTATTAGCCA GGATGGTCTC AATCTCCTGA CCTCGTGATC 

54301 CGCCCGCCTC AGCCTCCTAA AGAGCTGGGA TTACAGGCX3T GAGCCACTGC 

54351 GCCCGGCCCA GGTTTAATAA TTTTTATAGA ATATTTTCAC AATCACCAAG 

54401 CCTTTTCTCT ACCAGCTCCA TTCTGCCCAT CCATTGAATT CTTTTTATCT 

54451 CAGTTACTTT ATGTTTCAGT TCGAAAGTTT CTACTTGGTT AGATAGATAG 

54501 ATGTTATATC ATATATTATA TGTTATATAA AAATATATTT ATGGTTATAC 

54551 ATATAACATA TATGTTATAT ATAGTTATTT ATATAGCCAT AACTATATAT 
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Table 1 (cootinaed) 



Genomic CA125. Amino Terminal Sequence 
(SEQ ID KO: 1) 



54601 AGCCATATAT ATAGTTATAT ATAACCATAT ATATAGTTAC CATATAGTAA 

54651 CCACATATAT AAAACATATA TATATAGTGT CTCTCTATAT ATAGTTATAT 

54701 ATATAGTTTC TATATCTGTA ACTATATATA GTTATATATG TATGTTTCTC 

54751 TGTATATAAA TATATATATT TCTATATATA TAGTTATACA CATTATATAT 

54801 ATAACTGGGA GATGTTGGTA AAGGATGGCG TGAGGAAACC TGQAGCAGTC 

54851 ATGGTAATCC TCGCTCTGGT CCGAACTCCT CAAGAGCAGG AGAAGGGTCC 

54901 TCCTCATTCT CCAGCCATGT TGACTTTGAG CAATTTACTC ATCCTCTCAG 

54951 TACCTCAGTT TCCTCACCTG CCAATTGAGG ATAATAATAT TTCATAAATT 

55001 GTTTGCAAAT GTTATATGCA ACTCTACX3TA AGAACACCTA GCACAGGGGC 

55051 TACCAGGGAA TTTGGTTTAA CAAATATTTA TCAGGCACCT ATTCTGGGCT 

55101 GGGCAGGGGG GATAAGATGT TGACTAAGTC AAATGCAGTC CCTCCCCTCA 

55151 CCAAGTTTAC AGTGTATTGG GCAAGACTGA AATGGAACAA GCAATTACAA 

55201 TTQACAATAA AAGACAACCA A6TTATTGAG CACTTACTAT ATGGCATGCC 

55251 ATATGCTATG TATTTTTTTT ATTTTTAACT TTTCATTTTG AAATAAATAA 

55301 TAAATATAAA GTAAATAATA ATATAAATAA ATAATAAATA ACTTTTCATT 

55351 TTGAAATAAA TAATAAATAA ATTCAGGAQA TGTTGC6AAA ATAGTGTAGC 

55401 ATTCCCCTGT ATCCTTCACC CAGTTTCTCC CCAATGGCTA CATCTTACAT 

55451 AACTCTAATA CAATATCAAA AGCAGQAAAC TGACATTGTT AAAATCCATT 

55501 TTACTGGTTT TACACGCGTG TGTGCATATG TQAGCTTGTG TATGTGCX3TG 

55551 TGTGTGCAGG CATGTGTGTG CATGCACGCC TGTGTGTGCA TATGTGCATG 
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Table 1 (continued) 

Genomic CA125 Amino Terminal Sequence 
(SEQ ID 170: 1} 

55601 TGTGCATGCG TGTGTGCATG TGTGCATGTG TGTGTGCATG CGTGCGTGCG 

55651 TGCGTGCATC TGTGTGCATG TATGCACATG TGTGTGTGTC TGTGCACGTG 

55701 TGTGCATGCA TGTGTGTGTG CGTGTGTGTT GGTAGCCCTA TGCAATTTTT 

55751 ATCACATGGG CATAGCCCTA TAATCACCAC CACCATCAAG ATTCAGAACT 

55801 GTTCCATTCC CCCAAAGATT CCCCTCATGC TAGCCTTCGT AATCATGCCC 

55851 ACTGAGCCCA ACACTATTGC ATAGAATAGC TATTCTACTC TCCATCTCCA 

55901 TCTCTGTCTC TACAATTTTC TTTTGAAGAT GTTATATAAA TGGAAATGTA 

55951 CAACATGTCA CCTTTGAAAT TGGCTTCTTT TCCACTCAGT GTAATGCCCT 

56001 GGAGATGTGC TCTTTTTAAC AGTCATGTAA CCTTCCTAAT TTCCCTCCAA:' 

56051 AATATCATTA TGCCCCTCGC CGCCTTTTTT TTTTTTTTTT TTTTTTGAGA 

56101 CAGAGTCTCG CTCTGTTGCC CAGGCTGGAG TGCAGTGGTA TAATCTCAGC 

56151 TCACTGCAGC CTCCGTCTCC OGGGTTCAAG GGATTCCCCT GCCTCAGCCT 

56201 CCCAAGTAGC CAGGATTACA AGTGCATGCC ACCACGCCTG GCTAATTTTT 

56251 GTATTTTTAG TCGAGACGGG GTTTCATTGT GTTGGCCAGG CTGGTCTCGA 

56301 ATTCCTGACC TC/^GTGATC TGCCCGCCTT GGCCTCCCAA AGTGCTGGGA 

56351 TTACAGGTGT GAGCCACCGC GCCCGACCCA TATTGCCCAT TGTATTACAG 

56401 CGGAAGAAAC TGAGGTATGG ACAGGTAACA TGTCCATGGT CACTTGGCTG 

56451 GTGAGGGGCA GAGAGGAGAT TTGAAACCAA ATCTGACTCA CTAGTGTGGC 

56501 CGTAACCATG GTAACTATGT CTCTCTACCA TGTGGTCTCC TCTTTATTAA 

56551 AGGAAGGGCA AGTTCTGGGA GTTTTGGGAG TTTTGGGCTT GAGTGGGGAA 
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Table 1 (continued) 



Genomic CA125 Amino Terminal Sequence 
(SSQ ID NO: 1) 



56601 


GGGTAGCCAA GTAAAGCAGG 


TGAGAGAAGG 


TCTGCTTTAA 


GGACTGCTGT 


56651 


TTGATTTTTA TTGTTGTTGT 


TCAGTGTTCA ATGGGATTGA GTTGACTCTT 


56701 


TTTTCCCTTC TTGTTCCCCA 


AAGCATGAGA 


CTGTTCCGGT 


CCTTTTCCCT 


56751 


TTTAACTTCT CAGCTAGAGT 


TTGTTAGGGC 


GGGTATGGGC 


ACCTGGCAGA 


56801 


GTCTGAGACC TCAGCTTCCA 


GTAGGCACAC 


GTTCTGACCC 


AATACACCTA 


56851 


CCCTGGTCCC CTAACCTGCT 


TCTGGTCCCC 


TAACCTGCTT 


CTGGGCCCAG 


56901 


GTAATGCATT TTAGGAACAT 


CCCACTTTTC 


TCCTTACCTG 


GCTTTCCATT 


56951 


ATCCGTCCAA ACTAAAGCAC 


CCACCTGTCT 


GCTTCAGACT 


CTTGCTTCAA 


57001 


GCACTCCGTC TGGGTCCTCA 


GAAATTGACT 


TACAGTCAGT 


TCAGATCTGA 


57051 


CTCAGGCGTG GCCTTCTTTT 


CTCCTTCCTT 


GC 
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Table 2 



Genomic Repeats 
(SEQIDNO:2) 



ExonRl 

1 
51 
101 
151 
201 
251 
301 
351 
Exon R2 
401 
451 
501 
551 
601 
651 
701 
751 
801 



AGCAGCCACA GTCCCATTCA TGGTGCCATT CACCCTCAAC TTCACCATCA 
CCAACCTGCA GTACGAGGAG GACATGCGGC ACCCTGGTTC CAGGAAGTTC 
AACGCCACAG AGAGAGAACT GCAGGGTCTG GTGAGAGCCC CGCCCACCGT 
ACTCCTCCCT CGCCCACTTA GACAAACCAG CCCACCTCAC ACTGCCTCGC 
CCACTGATGC CAGCCACGCC CACCTCATCC AACCCCAGAC ACCTTTCCCT 
GCCCCACCCA CTGATTTTAG CCAAGCCCAC CTCACCCCAC CCAGCCTACT 
GATGCCAGCC ACGCCCACCT TTCCCTGCCC CGCCCACTGA TTTCAGCCAC 
GCCCACCTCA CCCTGGTCCA CCCCTCCAAT GCCCCACTCT TCCTGGCTTC 

CCGCAGCTGT TGTTTCTCAC CTCCCCTCTC CTTCCTTGCA GCTCAAACCC 
TTGTTCAGGA ATAGCAGTCT GGAATACCTC TATTCAGGCT GCAGACTAGC 
CTCACTCAGG TGAGACGCTC CTTAAGAAAA ACACAGCCCA ACAGGTGAAT 
ATGACCCTAG TCTCTGGGCT CCCTGACTCT GTTCATACTT GGAACAACTA 
TTGCCCATGG ATACTAAGCA TCACCACCAG CAGCAGCAGA TAACTATTCC 
TAAGACCCAA GGCACTGCAT TATGTACTTT ATATTTAATG CCTCATCAGT 
GCTTGCAACA GCCTCATGAA GCAGGAGCAG AAGGGGAAAC TGAGGCCCAG 
ATTAAGTGGC TTGTGCCAGG ACACACAAAG CAACTGCAGC ACTTCAGGTT 
CTATATCCAA ACTCCTATCC CTTAGGTGGC ACTTCCTCCT CTGCCCCCAT 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 



TATGAACTTG CAGCATGTGG AAAACCCCAA TCTGACTTCC CTCTAAGGGA 
ACTTGCCCAG AGAATCTAAG AGGGGAGGAA AGGAAGGCGT TCAGCCCTTA 
CAGGCAGGAG GTCAGCTCCT GAGTGGCTCA GATGCAGCCA CAGAGGGCCT 
GGCCGGTCTG AGGGTGACTG AGAGGCACCG AGGGCACTGT CCCTGAGTGC 
TGGAAAGGGC AGGTCTTTTA GGGTAGACAG CGGTTGATAT CATTTCCTGC 
CTGGCATTCT CACCTTCCAC ACCTCTCTCA CAGAATCTCC AAGTGTGGCT 
CTCCCAAGAG AGAGTGTCAG TCATCTACCT CCAGCTTCCT TTCCTTCCCA 
GGGGGAAGAG GGGACAGGGG GGCCCTAGTG GCTAAGAGCA TTGGTGAACT 
CAGGCAGACC TCAGTTCTGA ACCAACCCAG CTCTGCCATT TACTATCTGT 
GACTCTGAGC AAGTGCCTGA AGCCTTCTGT GCCCTATTTC CTGACATATT 
ATATATATAA AATACATATA TTATATATAG ACATATTTTA TATACATATT 
GAGGCATATT TTATAAACAT GTTTATAGAC ACATTTTTAT ATGCATATGT 
TATATACGTA TATAACATAT GTTATATATA ATGTATATAT TATACATATT 
GTTATATTGT ATACATGTTA TATATGTTAT AGCATATATA GTACAAGTTA 
TATATAACAC ATACATTATG TTACATATAA TGTATATGTT ATATATGATA 
TATTATATAT AATTATATAT TATATAAAAC TGTTATATAT AATTATATAT 
AATATATAGT TGTTATATAT AATTATATAA TTGTTATATA TTATATACAA 
CATATAACAT ACATTATATA TTGTTATATA TAATATAATA TATACATATA 
TAACATATGT ATAACTTTTA TGTTATACAT AATGTATATA ACATATATGT 
GTATGTGTGA TGTACATAAC ATATCTGACA TTAACATATA ACATATGATA 
TAACAATATT ATATGTTATA ACATAATATA TGTTATAATA TAACAATATT 
ATATGTTATA ACTTATACTG TCATATGTAA CATATACATA ATATTTTATA 



77 



wo 2004/045553 



PCT/US2003/037041 





Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



1951 AATCAGTTTA ATATACATTA TGTTACATAT AATGTATGTT ATATATGATA 

2001 TATTATATAT AATTATATTA TACATAATTG TTATATATAA TGCATACATT 

2051 GTATTTGTTA CGTATTATAT GCAACS^TATG GGGATCCTCT AGAGTCGQAC 

2101 CAGCGGCAGC AGCTGCCTGC CTTTTNNNNN NNNNNNNNNN NNNNNNNNNN 

2151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

2201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

2251 ATATACATAC ATAACATATG TATAACTTAT ATGTTATATA TAAGTATATA 

2301 ACATATATGT GTATGTGATG TATATAACAT ATCTGACATT AACT^TATAAC 

2351 ATATGTTATA ATATGACATA TTATATATAT TACATATAAC GTATATCATG 

2401 TATAATATAA TGTGTATATA TAATATATTA AAGTATATAA GTATAAATAC 

2451 ATGTAATATT TAAATATATA TTATATATAG TATACATGTG GATACATACA 

2501 ACTTCTACAT ATACCTAGTA TATATTCTAT ATATAAACAG TCCATGAATT 

2551 ACAATGATTC AACTTATGAT TTTTCAAACT TTGTGATAAT GCCATAGCAA 

2601 TATGCATTCA GTAGAAAGCA TACCTTCAAC ACCCATGCAA CCATTCTGTC 

2651 ATTCACTTTC AGTACAATAT TCAATAAATT ATATGAGATA TTCAACAGTT 

2701 TATTATAAAA TAGGCTTTGT GTTAGGTGAT TTTGCCCACA TGTAGGCTAA 

2751 TGTAAGGGTT CAGAGCATGT TTAAGGTAGG ATAGGCTAAC CTATCATGTT 

2801 CTGTAGGTTA GGTATAGTCG ATTTTTATTT TTATTTTTAT TTTTGAGACA 

2851 GAGTCTTGCT CTGTCACCCA GACTGGAATG CACTGGTGCG ATCATAGCTC 

2901 ACTGCAGCCT TGAACTCCTG GGCTCAAGTG ATCCTCCTAC CTCAGCCTCC 

2951 TGAGTAGCTG GGACTACAGG TGTGTGCCAC CACACCTGGC TATTTTTTTT 

3001 TTAATTTTTT TTTTTTTGTG GAGAGGAGGG TCTTGCCATG TTGCCCAGGT 

3051 GGCCTTGAAC TCCTGGGCTC AAGGAATCCT CCCACCTTGG CCTCCCAAAA 

3101 TCCTGGGATT ACAGGTGTGA GCCATCACGC CCGGCTACAG GGCATTTTTG 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



3151 
3201 
3251 
3301 
3351 
3401 
3451 
3501 
3551 
3601 
3651 
3701 
3751 
3801 
3851 
3901 
3951 
4001 
4051 
4101 
4151 
4201 
4251 
4301 



ACTTATGACA TTTTCAGTTC ACAATGGATT TGTCAGGGCT GGGCATGATG 
GCTCACACCT GTCATCCCAG CACTTTGGGA GGCTGAGGCA GGTGGATCAC 
TTGAGGCCAG GAGTTTGAGA CCAGGCTGTC CAAATGGCAA AATCTTGTCT 
CTACTAAAAA TACAAAAATT AGCCAGGCGT GGTGTGACAA CTGTAGTTCC 
AGCTACTCGG GAGACTGAAG CGTGAGAATC ACTTGAACTT AGGAGATGGA 
AGTTACAGTG AGTCAAGATC ACACCACCGC ACTCCAGCCT GGATGACAGA 
GCAAGACTCT TGTCTCCAAA AAACAAAAAA CAGGCTGGGT GCATGGCTCA 
TGCCTGTAAT CCCAGCAGTT TGGGAAGCTG AGGCAGGTTT ATCACCTGA6 
GTCAGTAGTT CACGATCAGC TTGGCAAACA TGGAGAAAAC CCATCTCTAC 
TAAAAATACA AAAATTAGCT GGATGTGGTG GTGGGTACCT GTAGTCCCAG 
CTACTCGGGA GGCTGAGGCA GGAGAATGGA TTGAACCTGG GAGGCAGAGG 
TTGCAGTGAG CCAAGATCAC ACCATTGAAC TCCAGCCTGG GCAACAGAGT 
GAGACTCCAT CTCCAAAAAC AAAAGAAAGC AAAAACAAAA AAATAAAATA 
AAAAACCTGT GTTTATCAGG ACATAATACC ATCATGAGTC AAGAAGCATC 
TAAATGTACA TGGTAGTTAT ATAAAAATAG TTATATAGTT ATATACAATA 
GTTATATATA AACCAGTTTA ATATATGTTA AGTAGAGGTA TATGGTAGTT 
ATATAAAAAA TAGTTATATA ATAGTTATAG AGTTATATAA TTATATAAAA 
TAGTTATATA TAAACCAGTT TAATATATGT TAGGTAGAGG TATAATAATA 
TATATTGTAT ATACTATATA ATATAGTAAT GTATAAAATG CAAT^CGATA 
TCATATATTT CTATATTAAG TTTATATTTA CAGATCTACA TTTTATATAT 
TTTATGTTAT ATACAATTGT GTTATACATA ATATAATTAG TATAGTACTG 
ACTTGGGGAA TTGAGCAGTA CCAACCCATA GGGATGTTTG AGGATGAAAA 
TATGTGATTA TGAATACA/^ ATGCTGGGCC TGCTGCATAG GAAGTATTTA 
ATAAATGGTA GTTGTTACTA TAAAGTCGTT CCTACTATAG AGCTACTCAC 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO: 2) 



4351 AACCTGGGAC ATAGGGAAAG AGCCCGTTTC CCTCTAATCA CTCAATAGTG 

4401 GGTGGCTAGG TAGGTGAGTC CACATCCTGT GGCCGGGAAC AGGTGCTGAG 

4451 ACATGAAGAC CTTCTGACTG CATGTTGGAC CAGCCACAGT TTCAGACGGA 

4501 CCAGCCAAAA AGGGCATTTT CCCCAAGCCA TTTAGCTCCC TTGAGTCTCA 

4551 TAACAAATCT CCTAGACCCT GCTGGTCCAT AGGATCTAGA GAGGATGACT 

4601 TGAACCTTCT GATCCCACCA TTTGAAAACG CCATGCCATG GGCACCAGTA 

4651 GGAGGGCCAC TGCTACGTGC ACCAGTACAA GGGCCACTGC CATGGATTAC 

4701 AGATTAACCC TAAGTATAGC TGTCGCACAC CTAGTACTTC AGGAGGCTTA 

4751 TTCGGGGCCA TGCAGATCCC TGGCATTATT ATCCTAGGAT CCTACACCAA 

4801 GCAAAGCAGG AGCTGCCCCT CCTCATAAAC CCATAAGCCC TCCTCTTGAG 

4851 CAAAGCAGCT GGGAAGGCCA GAAGTTATTC AAGCTCCCCT CTGCCCCGGT 

4901 TCCAAAGACA GACAGCTCAA GCCTACATGC AGCAAACCCT ATAT^GTGT 

4951 CACCTCTTGG CATTTCTGCC ATGGTAATGC TTTCTGCTTC CACTAATAAT 

5001 CCTAGTAATT TGTTTATGGT GGGCATCTCT CTGATGAGAA CCACATTCTT 



5101 CTGGAGTGCA GTGGCGCGAT CTCGGCTCAC TGTAACCTTT GGCTCCTAGG 

5151 TTCAAGCAAT TCTCCTGCCT CAGCCTCCCA AGTAGCTGGG ACTGCAGGCA 

5201 CGTACCACCA TGCCCAGCTA ATTTTTGTAT TTTTAGTTGA GACGGGGTTT 

5251 CACCATGTTA GCCAGGATGG TCTCAATCTC TTGACCTCAT GATCCACCTG 

5301 CCTTGGCCTC CCAAAGTGTT GGGATTACAG GCATGAGCCA CCATGCCTAG 

5351 CCTGAGAGCC ACATTCTTGT TAACCACAAT TTTCTCAGAG TCTGCATTAG 

5401 GGGTTGACAA AGAGTGGAAA GGAAGGACAA AAGGATGGAG AGGTGGATGG 



5051 



TTTTTTTTTT TTGAGATAGA GTCTCACTCT GTTGCCCAGA 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



Exon R3 

5451 
5501 
5551 
5601 
5651 
5701 
5751 
5801 
5851 
5901 
5951 
6001 
6051 
6101 
6151 
6201 
6251 
Exon R4 
6301 
6351 
6401 
6451 
6501 



ACTAAGCATA TGTAGGTTCT TACCCAGGCC AGAGAAGGAT AGCTCAGCCA 



CGGCAGTGGA 


TGCCATCTGC 


ACACATCGCC 


CTGACCCTGA 


AGACCTCGGA 


CTGGACAGAG 


AGCGACTGTA 


CTGGGAGCTG 


AGCAATCTGA 


CAAATGGCAT 


CCAGGAGCTG 


GGCCCCTACA 


CCCTGGACCG 


GAACAGTCTC 


TATGTCAATG 


GTGAGCAGCT 


GTGATGTGGT 


TGGAGGCTCT 


TCCTCCTTGC 


TGAGCAGCCT 


GTAATCACTG 


GCTTGAGGTC 


ACACTCACTG 


TCAGGCAATT 


GAAAATTTGG 


TCCTGTGCTC 


TACATGGGAT 


GACTAATTTC 


CGGACTTCAT 


GGTATCTTTT 




^rpT^TTTTG 


AGATGGAGTC 


TCGCTCTGTC 


ACCAGGCTGA 


GGTGCAGTGG 


CATGATCTCA 


GCTCACTGCA 


ACCTCCGCCT 


CCCGGATTCA 


AGCAATTCTC 


CTGCCTCAGC 


CTCCTGAGTA 


GCTGGGACTA 


CAGGTGCATG 


CCACCACACC 


CAGCTAATTT 


TTGTATTTTT 


AGTAGAGACA 


GGGTTTCACC 


ATGTTGGTCA 


GGATGGTCTC 


AATCTCTTGA 


CCTTCTACTC 


CACCTTGCCT 


TGGCCTCCCA 


AAGTACTGGG 


ATTACAGGCT 


TGAGCCACCA 


CACCTGGCCA 


GGACTTCATG 


GTTTCTTCAT 


CATCATGGAA 


TGAATTCCAT 


CAGGGCATTC 


TTCCCTGATG 


TGAGGGCACT 


GATAGGAAAT 


CTTTAATGGT 


CCCTGCTGCA 


TGAAACTGCT 


TCCATTGCAC 


CAGGGTAGCC 


CTGACCCCTA 


TTTGGTCCCC 


CACATCTCCT 


TGTAACTTAC 


CCACACTCCT 


CCCTCCTTCT 


CTGTGCAGGT 


TTCACCCATC 


GAAGCTCTAT 


GCCCACCACC 


AGCAGTGAGT 


ATTCAACTCA 


TGTCCACATG 


CCCATGATCC 


TACACCAAGC 


AAAGCAGGAG 


CTGCCCCTCC 


TCATAAACCC 


ATAAGTCCTC 


' CTCTTGAGCA 


AAGTAGCTGG 


GAAGGCAGAA 


GTTATTCAAG 


CTCCCCTCTG 


CCCCAGTTTC AAAGACAGAC 


TCAGCTCAAG 



CCCACATGCA GCAAACCCTA TAAAAGTCTC ACCTCTTGGC ATTTCTGCCA 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



6551 TGGTAATGCT TTCTGCTCTC ACTAATGAGG ACTTCTCCTC A GCTCCTGGG 
Bxon R5 

6601 ACCTCCACAG TGGATGTGGG AACCTCAGGG ACTCCATCCT CCAGCCCCAG 

6651 CCCCACGAGT AAGTACCAGT CAATGGCATC TCTATTAGAG CATGCTATCT 

6701 CTGTCATTTT TACTCAGATG AAGATGQAAA ATCATASCAA ATCTACTGAT 

6751 AGTGAGTGGA CCAACGAAAT TTGTTGGCCA CCTAGTGTGT ACCAGATCCT 

6801 AQAGATACA6 GAGGGAAAAC AAAACCAATA CAAAATTTCT GCTCTCAGTG 

6851 AGCTTGTATT CTTGTCATGA TGATGATGTT GGTGGTGGTG CTGTTGATGA 

6901 CGATGATGAT GATGATGATG ATGATGATGC TGGTGATACT GTTGATGGTG 

6951 ATAGTGATGT TGATGACAAT GATGATGATG ATGATGTTGA AGAAAATGAT 

7001 GCTGGTGATG GTGGTGGGGG TTATTATGGT AATAATGATA TGTTGAGTGT 

7051 GACGATGATG GTGGTGGTGT TGATGATGAT GATGATTATT ATGCTAGTGA 

7101 CATTGATGAT GGTAATGGTG ATATCAACGA CAGTGACAAT GATGGTGATG 

7151 AGGATGATGT CGGTGATGGT GGTGGGGTTA TGATGGTAAT GATATGTTGA 

7201 ATGTGATGAT GGTGATGATG ATATTTGTGG TTCATGATGG GGATTGTCAT 

7251 GGTGGTGGTG GTGGTACTTG TGATGACAAT AATGATAATA ATGATGACAA 

7301 TGATAGTGAT QATOGTOATQ GTGATAATAA AGATAACAGA TATCACCTTA 

7351 CAATATTGAG CACTAAATAT GTACCAAGAG CTATGCTCAG TATCTAACTA 

7401 CTATTATATA ATCTACTTTA GAAAATGAAT TGTATCATAG ATAAGAAAGG 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



7451 
7501 

Exon Rl 
7551 
7601 
7651 
7701 
7751 
7801 
7851 
7901 
7951 
8001 
8051 
8101 
8151 

Exon R2 
8201 
8251 
8301 



CGTGGAAAAT ATTTATTATG TCACTCAATT TAATTGCTGC ATATGGTTAT 
TACAAAGTGC TATTCTCTCT ACTTTGAACA TAATGTTTAT TTCACACTCC 

CACTAT AGCT GCTGGCCCTC TCCTGATGCC GTTCACCCTC AACTTCACCA 
TCACCAACCT GCAGTACGAG GAGGACATGC GTCGCACTGG CTCCAGGAAG 
TTCAACACCA TGGAGAGTGT CCTGCAGGGT CTGG TTAGTG TCCTGCCCTC 
CACACTCTGC CCTGCTCATG ATACCCAGTC CCTCTTACAT CATCCATGCC 
AGGGCAATGG AAGAATATCA AACCCAACTC ACTTTTGCCC CAAGAGATGC 
AAGCCTCAGC CAGGAGCGGT GGCTCACGCC TGTAATACCA GCATTTGGGA 
GGCCAAGGCG GGTGGATCAC CTGAGGTCAG GAGTTTGTGA CCAGCCTGGC 
CAACATAGTG AAACCTCATC CCTACTAAAA TACAAAAATT AGCCAAGCAT 
GGTGGTGCAT GCCTGTAATC CCAGCTACTT GGGAGGGTGA GGCAAGAGAA 
TCACTTGAAT CAAGGAGGCA GAGGTTGCAG TGAGTCAAGA TCATGCCACT 
TTACTCCAGC CTAGGCAT^ AAGCGAT^CT CCATCTCACA AAAAAAAGAA 
AAAAAGAGAG AGATGCAAGC CTCCCCCACC AAGGCCAGCC CTGCCCACCT 
CACTTCTGCC TGGCTCTTAC ATAAAACTTA GCCCTCCTAC TCACTGCCCT 

CTCCCTCCTC CACA GCTCAA GCCCTTGTTC AAGAACACCA GTGTTGGCCC 
TCTGTACTCT GGCTGCAGAT TGACCTTGCT CAGGT GAGAA CTTAGAATTT 
CCAGCCTGGC TGCCCCACTT GTACTCACTC CAAAAGACTT TGCACTGCTT 
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Table 2 (continued) 



Genomic Repeats 
(SEQroNO:2) 



8351 


CCTTGCTGCA 


CTTCCTAGGG 


ATATCCTCAC 


CAAAGGTGGA 


ATTCAGGAGT 


8401 


CACAGGCTTC 


AGGATCAGTG 


TGTTTCCTGA 


CAGTAACACC 


CCTACACTCC 


•8451 


ACCTCAACAG 


AGAGAATCTG 


CATGGCCCAT 


CATCAGGATT 


GAGCCTCTCC 


8501 


CTTTATCATC 


CCTCTGAATT 


CCCTCCATTC 


CCTGTGCCTC 


CCTTTCCTTT 


8551 


ACATGTTAAA 


TTCTGTCCCC 


AGGATTTCTT 


TCAGGACAAT 


CATGCCTTAT 


8601 


CCACGTGATT 


TCATCCTCAT 


TTCGAGCTCT 


TCACTGGGCT 


CAAGTCCGGC 


8651 


TCCCCGTCCC 


GTCCATGAAA 


GTGTCAGTTT 


CATCTTGTCA 


CTGTATCCGT 


8701 


GACTCCACTC 


ACAGTCCTCA 


GCAAGCCAAT 


AGTCCATGCA 


CTAAGAGTCG 


8751 


ATGTGGCTTC 


TCACCTCTTT 


CCCAGGTTTC 


TCATTTCTCT 


GGTCCTTGCT 


8801 


GTCCTTCCCT 


CAGCAATCGC 


AAGACCCTTC 


CTAGATAAAC 


TTTTCATTGT 


1 R3 












8851 


GATTTTTCCC 


ACTGACCCTC 


CCCAGGCCCG 


AGAAAGATGG 


GGCAGCCACT 


8901 


GGAGTGGATG 


CCATCTGCAC 


CCACCGCCTT 


GACCCCAAAA 


GCCCTGGACT 


8951 


CAACAGGGAG 


CAGCTGTACT 


GGGAGCTAAG 


CA/^ ^vZTGACC: 


AATGACATTG 


9001 


AAGAGCTGGG 


CCCCTACACC 


CTGGACAGGA ACAGTCTCTA TGTCAATGGT 


9051 


GAGTGGCTGT 


GATGTGGTTG 


AAATCTCTTC 


CCCCTTGCTG 


GGCAGCCTCT 


9101 


AATCTCTAAC 


TAGAGATCAC 


ACTCCCTGCC 


TGGCCTTTGA AAATTCTGTC 


9151 


ATGTGCTCTA 


CATGGGATGA 


CTAAGGTCTG 


GACTTCATGG 


TTTCCTTACC 


9201 


ATCATGGACT 


GTGTTCCCTC 


AGGGCATTCT 


TTCCTGATGT 


GAGGATGCTG 


9251 


ATAGAAAATC 


TTCAATTGTC 


CCTGTACCAT 


GAAACTCGGT 


TCATTGCACC 


9301 


AGGGTAGCAT 


TGACCTCCAT 


TTGGTCCCCC ACCTCTCCTT 


GTCTCTTACC 



84 



wo 2004/045553 



PCTAIS2003/037041 





Table 2 (continued) 



Genomic Repeats 
(SEQroNO: 2) 



Exon R4 

9351 
9401 
9451 
9501 
9551 
9601 
Exon R5 
9651 
9701 
9751 
9801 
9851 
. 301 
9951 
10001 
10051 
10101 
10151 



CACTCTCCTC CCTCCTTCTC TATGCAGG TT TCACCCATCA GAGCTCTGTG 
TCCACCACCA GCAG TGAGTA TTCAACTCAT ATCCACATGC CTCGGTTCCT 
ACACCAAGAG GAGCAGGAGC TGGCCCCTCC TCATAAACCC ATTAAGTCCT 
CTTCATAAGC AAAGGATTTA GGAGGGCAGA AGTTATTTAA GTGTCCCTCT 
GCCCAGCTCA AGAGACCGAC CCAGCTCAAG CTACACATGC AAC7\AACCCC 
ATAAATAGTC TCCCCTCTTG CCATTTCTGC CAAGAGAGTG CTTTATGCTT 

TCACTGATGA GAACTTTTCC TCA GCTCCTG GGACCTCCAC AGTGGATCTC 
AGAACCTCAG GGACTCCATC CTCCCTCTCC AGCCCCACA ?V GTAAGTATCA 
GTCAATGACA TCTCTATGAG AGCATACCTG ATTAGTGTAA ACATCTCTGT 
CATTTTCACT CAAATAT^GA TGGAAAATCA TAGTAAATCT AGTGATACTG 
AGTGGACAAA TTTGTTTGTT TGTTTTTTCT CATCCTTTTC ACTTTTTTTA 
TTATACTTTA AGTTTTAGGG TACATGTGCA CAATGTGCAG TTTAGTTACA 
CATGTATACA TGTGCCATGC TGGTGTGCTG CACCCATTTG CTCGTCATTT 
AGCATTAAGT ATATGTCCTA TGCGATCCAA GCCCACGCGC CGCACCACGT 
GCAACAGTTT CACAGATTGG ATGGTCCGAT ANNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNKINN NNNNNNNNNN NNNNNNNNNN 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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Table 2 (continued) 



Genomic Repeats 
(SEQ ID NO: 2) 



Exon Rl 

10201 CTTCACCA.TC ACCAACCTGC AGTATGAGGA GGACATGCAT CGCCCTGGA.T 
10251 CTAGGAAGTT CAACACCACA GAGAGGGTCC TGCAGGGTCT GGTTAGCACC 



10351 TTCATGCCAG GGCAATGGAA GAATATCGCA CCAACCTTGC CCTCATCCCC 
10401 AGAGATGCAA GCCTCACCCA CTGAGGCCAG CCACTCTCAT GGGTGTCTGC 
10451 CCCACCCACC TCACTTTTGT CCCCACACAG GGACCTTAGC CCTCCTACTT 



10501 ACCTCTCTCT CCCTCCCCCA CA GCTTAGTC CCATATTCAA GAACACCAGT 

10551 GTTGGCCCTC TGTACTCTGG CTGCAGACTG ACCTCTCTCA GGT GAGACCT 

10601 TAGAAGATCC AGCCTGGCTG CCCCAGTTGT TCCCACTCCA GTAGATTTTG 

10651 CTCTGCTTCC TTGCTGCACC TCCTAGGGAT ATCCTCACCA AAAGGGGAAT 

10701 TCAGGAGTCA CTGGCTTCTG GACCAATGTG TTTCCTGATA GTAACACTCC 

10751 CACACCTCAC CTCAACAGGG AOAATCTGCA TGGTCCATCA TCAGGATTGA 

10801 GCCTCTATCC TGATCATCCC TCAGAATTCC CTGCCCCTCC CTTTCATTTA 

10851 GGTGTTAAAT TCTGTCCCCA GAATTTCTCT CAAGACAATC ATGCCTCATC 

10901 CAAGTGCTTT CATCCCTGTT TCTAGCTCTT CACTGGTCTC AAGTCTGGGC 

10951 TCTCCTGTCC CCATGCTATG AGAATGCAGG TTTCACCTTG CACTTTTATA 

11001 AGCATGGTTG TATCTGTGAC TCTGTGCACA 6TCCCAAGCA AGCCAGTAGT 

11051 CCATGCACTC AGAGAATCTA AGTGTAGCTT CTCACCTCTT TCCCAGGTTT 

11101 CTCATTTCCT CTGGTTCTTT ACTGTCTTTC CATCAGCAGT CTCAGGACAC 



10301 



CTGCCCTCTT CACTCTCCCC CGCCCTGGAT GCCGAGCCCC TCATACAACA 



Exon R2 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO: 2) 



Exon R3 

11151 
11201 
11251 
11301 
11351 
11401 
11451 
11501 
11551 
11601 
11651 

Exon R4 
11701 
11751 
11801 
11851 
11901 
11951 

Exon R5 
12001 
12051 
12101 



AACCTAAGTA ATCTTTTCAT AGTCATTCTC CCCACCTACC TTCCCCAGGT 
CTGAGAAGGA TQGAGCAGCC ACTGGAGTGG ATGCCATCTG CATCCATCAT 



GAGCCGACTG ACCAATGGCA TCAAAGAGCT GGGCCCCTAC ACCCTGGACA 
GGAACAGTCT CTATGTCAAT GGTG AGCAGC TGTGATGTGG TTGGAGTCTT 
TTCCTTCTAG AGTCTGGAAA GAATCT7VATC TGTGGCTTGA AGTCACACTC 
CCTGCCTGGC CATTGAATAT TCTGTCATGT GGTGTAGATG GGATGACAAA 
GTTCTGGACT TCACAGTTTC TTCATTGTCG TGAACTGTGT TCCCTCAGGG 
CACTCTTCCC TGTTGTGAGG ATACTGATAG GAATTCTTTA ATGGCCCCAG 
TCCCATGAAA CTCATTGTCC CATGAAACTC ATTTAATTGC ATTGGGATTG 
CCATGACCTT ATTGTGTCCC TCGTATCTCC TTAACGCTTA CCT^GTCTCC 

TCCCTCCTTC TCTATGCAGG TTTCACCCAT CGGACCTCTG TGCCCACCAC 
CAGCAG TGAG TATTCAACTC ATGTCCACAT GCCCCTGATC CTACATTAAG 
TGGAGCAGGA GCTGGCCCCT CCTCTTAAAC CCATAAGTCC TCCTCTTGAG 
CAAAGGAGCT GGGAAGGCAG AAGTTATTGA AGCTCCCTTC CACCTAGCTC 
CAAAGACAGG CCCAGCTCAT GCCCGTATGC AGCAGACCTC ATAATAGTCT 
ACCTTCTTGC CATTTCTGCC ATGAGATTAT TTTCTGCTTT CACTGATGAG 

CACTTTTTCT CAGCT CCTGG GACCTCCACA GTGGACN NNN NNNNNNNNNN 
NNNNNNNNm NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



Exon Rl 

12151 
12201 
12251 
12301 
12351 
12401 
12451 
Exon R2 
12501 
12551 
12601 
12651 
12701 
12751 
12801 
12851 
12901 
12951 
13001 
13051 
13101 
13151 



ATTTTCAATT CCCACTA CAG CTGCTGGCCC TCTCCTGGTG CTGTTCACCC 
TCAACTTCAC CATCACCAAC CTGAAGTATG AGGAGGACAT GCATCGCCCT 
GGCTCCAGGA AGTTC7ACAC CACTGAGAGG GTCCTGCAGA CTCTGG TTAG 
TGCCCTTCCC TCCTCACTCT GCCCAGCCCC AGATATCCAG TCCCTTCTAC 
ATCATCCATG CCAGGGTGAT GAAAGAAGAT AGCAACAACT TCCCCCCTTC 
CCCCCAAGAG ATGCAAGCCC CACCCACAGA GACCAGTCCT GCTTATTGGT 
GCCTGCTCCA CCCACCTCAC ATCTGCCCCG ACACACACAC ACCTTAGCCC 

CACTACTCAC CTCCCTCTCC CTCCTCTACA GCTTGGTCCT ATGTTCAAGA 
ACACCAGTGT TGGCCTTCTG TACTCTGGCT GCAGACTGAC CTTGCTCAGG 
TGAGACTTTA GAAGAGCCAG CCTGGGTGCC CAAACTTGTT CCCACTCTAA 
AAGACTTTGC ACTGCTTCCT TGCTGCACTT CCTAGGTATA TCTTCACCAC 
AAGGGGAATT CAGGAGTCAT TGGCTTGAGA ACCAGTTGTT TCCTGATAGT 
AACACCCCCA TGCCCCAACT CAACATGCAA AATCTTCATG GTTCATCATC 
AGGATTGAGA CACTACCCTG ATTACCCATC TGAATTCCCT CCTTTCCCTG 
ACCCCTCCCT TTCATTTAGG TGTTAAATTC TGTCCCCAGG ATTTCTCTCA 
AGATAACCAT GCCTCATCCA CATACATGCA TCCGCCTTTC AAGCTCATCA 
CTAGTCTGAA GCTCTGGGTT CTCCTGTTCC CATGCCATGA GAATGCAGGT 
TTCACCTTGC ACTTTTATAA AAATTATTAT ATCCATGACT CTGCTTGCAG 
TCCCAGACCA AGATAGTGGT CTATGTACTC AGATAATCTA AGTGCAGATT 
CTCACCTCTT TCCCAGATTT CTCATTTCCT CTGGTTCCTT GATATGTTTC 
CCTCAGCAAT CTCAAGACAA GTCCTAGGCA ATCTTTTCAT TGTCATTCCC 
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Table 2 (continued) 



Genomic Repeats 
(SEQroNO:2) 



Exon R3 

13201 
13251 
13301 
13351 
13401 
13451 
13501 
13551 
13601 
13651 
Exon R4 
13701 
13751 
13801 
13851 
13901 
13951 



CCTCCTACCT TCCTCAG GTC CGAGAAGGAT GGAGCAGCCA CTGGAGTGGA 
TGCCATCTGC ACCCACCGTC TTGACCCCAA AAGCCCTGGA GTGGACAGGG 
AGCAGCTATA CTGGGAGCTG AGCCAGCTGA CCAATGGCAT CAAAGAGCTG 
GGCCCCTACA CCCTGGACAG GAACAGTCTC TATGTCAATG GTG AGCAGCT 
GTGATATGGT AGGGGTCTCT TCCTCCTGGC TGTGCAACCA TCTAATCTCT 
GGCTTGGGGG CACACTCCCT GCCTGGCCAT TGAAAATTCT GTCACGTGCT 
CTACATGGGA TGACTAAGTT CTGGACTTCA TGGTTTCTTT GTTATCATGA 
GAGGCATTCC CTCTGGGCAC TCTTCCCTGT TGTGAGGATG CTGATAGGAA 
ATCTTTAATG ACCCCTGTCC CATGAAACTC ATTTAATTGC ACCAGGGTAG 
TCCTGAACTC TATCGCGTCC CCCACATCTC CTTAACCCTT ACCCAGTCTC 

CTCCCTCCTT CTCTATGCAG G TTTCACCCA TTGGATCCCT GTGCCCACCA 
GCAGCAG TGA GTATTCAACT CATGTCCATG ATGCCCCTGA TCCTACATCA 
AGTGGAGCAA GAGCTGGCCC CTCCTCTTTA ACCCATAAGT CCTCCTCTTG 
AGCAAATGAG CTGGGAAGGC AGAAGTTACT CAAGCTCCCC TCTGCCCCAG 
CTCCAAAGAC AGACCCAGCT CAAGCCCACA TGCAGCAGAC CTCATAATAG 
TCTATCTTCT TGCCATTTCT GCCATGAGAG TGCTTTCTGC TTTCACTGAT 
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Table 2 (continued) 



Genomic Repeats 
(SEQroNO:2) 



Exon R5 



14001 


QAGGACTTTT 


TTCAGCTCCT GGGACCTCCA 


CAGTGGACCT 


TGGGTCAGGG 


14051 


ACTCCATCCT 


CCCTCCCCAG CCCCACAAGT 


AAGTACCAGC 


CAATGGTATC 


14101 


TGTATTAGAT 


CATGCCTGAT GAATGCAAAC 


ATCTGTGCCA 


TTTTCAGTCA 


14151 


AATGAAAATG 


GAAAATCATA ATAAATCTAG TGATACTGAG 


TGAACCAAAA 


14201 


AAAATGTATT 


GGCCACCTAC AGTGTACCAG 


ACCCTAGGGA 


TATAGCAAGG 


14251 


AAAATAGAAC 


CAATAAAAAC ATCTCTGCCC 


TCAGTGAGCT 


TGTGTTCATG 


14301 


TGATGATATG 


ATGGTGGTGG TGGTGGTAAT 


AGTAATAATG 


ACATATTCAG 


14351 


TTTGATGATA 


ATTTATGATT ATGGTGTTGC 


TGTTGATGAT 


GGTGGTGGTG 


14401 


ATGTTACTGA 


CAATGATGGT GACGGATCTT 


TGAGGATATT 


GTCCGTGATG 


14451 


GTCGTGAAGA 


TTATGATGAT AATGATGATG 


TGTTAAGTGT 


GATGATGATG 


14501 


ATGATCTGTG 


GTGATGCTGT TTAGGATGCT 


GTTCCGTGGT 


ACCGATGATA 


14551 


TTGATGTTGG 


TCGTGGTTAT GTTGTATGAC 


AATGACAATG 


ATGGTGATGA 


14601 


GGATAATCGC 


CAGTGATGGT GTGGGTTTAT 


GATGATGATG 


ATGTGTTGAA 


14651 


TGTGGTGATG 


ATAATGTTCG TGGTGGTCGT GATGGGCATT 


ACTATGGCAG 


14701 


TGATGGTCAT 


AATAATGATG GTGATGGTGA CAATGATAGC 


AAGGATGATG 


14751 


ATGGCAATAA 


AGATAGTACA TAACATCAGA CAATATTGAG 


CTCTGAATAT 


14801 


GCACCACGAG 


GAGTGCTCAG CATCTAAATA 


CTATTATATA 


ATATATTTTT 


14851 


GTAAAAATAA 


ATTGTATTGT TTTAGGCAAG 


GGAAGCATGG 


TAAATATTTT 


14901 


GTCACTCAAT 


TTAAATTCTG CATATGTTTA AAGATAAGTC 


TATTGCS^C 


14951 


TCCTATTTTC 


TCTACTTTGG ACATAGTGTT 


TGTTTCCCAC 


CTCCACTACA 
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Table 2 (continued) 



Genomic Repeats 
(S£QIDNO:2) 



Exon Rl 

15001 GCTGCTGGCC CTCTCCTGGT GCCATTCACC CTCAACTTCA CCATCACCAA 

15051 CCTGCAGTAC GAGGAGGACA TGCATCACCC AGGCTCCAGG AAGTTCAACA 

15101 CCACGGAGCG GGTCCTGCAG GGTCTGG TTA GTGCTCCACC CTCCTCACTC 

15151 CGCCCCACCC CAGAGAGTCA GTACCTCCTA CATCATCCAT GCCAGGTGAT 

15201 GGAACAAGAT CATACCCACC TCACCCTTGC CCCAAGAGAT GCAAGCCATG 

15251 CCCATTGAAA CCAGCCCCAC TCACTGATGC CTGTTACTGC CCCACCTGAC 

15301 TTCTGCCCTA CACACCCACA CACGCAACTT AGCCCTCCTA CTCATCTCCT 
Exon R2 

15351 TCTCCCTCCT CCACA GCTTG GTCCCATGTT CAAGAACACC AGTGTCGGCC 

15401 TTCTGTACTC TGGCTGCAGA CTGACCTTGC TCAGGT GAGA CCTTAGAAGA 

15451 TCAAGCTTGG CTGCCCCACT TGTTNWNNNN NNNimiNN^ mmmmm 

15501 NNNNNNNNNN NNNNNNNNNN NNNNN^NNN 

15551 NNlSnsJNNNNNN NNNNNNNNNN ISnSINWNNNNNN NNNransJNNNN - NNNNNNNl^ 
Exon Rl 

15601 NGTGTTAGTC TACTTTTGAA CACTGTTTAT TTCCCATCTT CACTATAGCC 

15651 GCCAGCCCTC TCCTGGTGCT ATTCACAATT AACTTCACCA TCACTAACCT 

15701 GCGGTATGAG GAGAACATGC ATCAGCCTGG CTCTAGAAAG TTTAACACCA 

15751 CGGAGAGAGT CCTTCAGGGT CTGG TAAGAG CCCCACATAC CTCATTCTAC 

15801 CGCCACTCAC CATGTTTAGT CCTGCCCACC TCACCTATTG CAGAGCATGG 

15851 AAGATCTCAT CTACCTCATC TTGCCCCCAG ATATGCATAC CCC?ACCACT 

15901 GATGCCAGCC CCACCAACTG TTGCCAGCCC TGCCCACCTC CCTTCTACCA 

15951 CACCCCTATG ACTTCAGTCC TCCCACTCAC CTCCCTCTCC CTCCTCCACA 
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Table 2 (continued) 

Genomic Repeats 
(SEQIDNO:2) 



Bxon R2 

16001 GCTCAGGCCT GTGTTCAAGA ACACCAGTGT TGGCCCTCT G TACTCTGGCT 

16051 GCAGACTGAC CTTGCTCAGG TGAGAACTGA GAACAGCCAG TCTGACTGAT 

16101 CTGAGCAGTT TGACCTGCTT CCCTTCTGCA CTCCCTGGAG ATGTCCGCAG 

16151 CCAGGTGGAA TCCAGGAGGC AGTGGCTCTA AGACCAATGT GCTTCCTGTT 

16201 CCCACCACCT CCCACCTCAA CTGAGAGATG CAGAGCCCAT CAGCAGGACT 

16251 GAGCTTCTAC CTTGGTCATC CCTCTGAATT CCCTCCTTTC CCCTACCTGC 

16301 CTTTCCACAA GTGGTTCAAT TCTGTTCCCA GGATTTCTCC CAAGAAAAAC 

16351 ATGCCTCGTC CACTTGCTTT CATCCCCAAA CCTAGCTCTT CACCTGTCTC 

16401 AAGTATGAGT TCTCCTTACC CCATGCTACA AGAATGCAGT TTCCACTTTG 

16451 CAATTTTATA AAAATCCTTG CATCCATGAT TCTGCTCATA GTTGCTAAGA 

16501 GTCAGTGCAG TCAQAGAATG GAAGTATGGC TTCTCACTTC TCTACCAGGC 

16551 TTCTCATTTC CTCTGGCCCC CTCCTGTCCT GCCCTGTGGG ATCTCAGAAC 

16601 CCCTCCCTAG GCAATCCGTG TATTGTCTTT CC -CAATCTT GCCCTCCCCA 
Exon R3 

16651 G GCCCAAGAA GGATGGGGCA GCCACCAAAG TGGATGCCA T CTGCACTTAC 

16701 CGCCCTGATC CCAAAAGCCC TGGACTGGAC AGAGAGCAG C TATACTGGGA 

16751 GCTGAGCCAG CTGACCCACA GCATCACTGA GCTGGGCCC C TACACACTGG 

16801 ACAGGGACAG TCTCTATGTC AATGGTG AGT AGTTGTGATG TGGTTGGAGT 

16851 CTCTTCCTCC TTGCTGGGCA GCCTCTACTC TCTGCCTTGA GGTCACGCTC 

16901 CCTGCCTGGC TATTGAATGC TCATCCATGT TGTCTGTATG TGATGGCT6A 
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Table 2 (continued) 



Genomic Repeats 
(SEQroNO: 2) 



16951 
17001 
17051 
17101 

Bxon R4 
17151 
17201 
17251 
17301 
17351 
17401 

Exon R5 
17451 
1/501 
17551 
17601 
17651 
17701 
17751 
17801 
17851 
17901 
17951 
18001 



GGTTGGAACT TCATGGTTTC TATTTCATCT TGGACTGAGT TCATCCTCAG 
GATCTGCTTT CTGGATCTGA GGGTGCTGAT AGAGAATCTT CAATGGTTCG 
TGTTCTGGGA AATTCCTTCC ATTGCACCAG GGTACCCTGA CCCCTATATA 
GTTCCCCACC ACTCCCTTAA CCCTTACCCA CCCTCTTCCC TCCCTCTCTA 

TGCAGG TTTC ACACAGCGGA GCTCTGTGCC CACCACTAGC AG TGAGTATC 
CACTGATTTC CAGTGCTCCT GATCCTACAT CATGCAGGGC AAGAACTGAC 
CCCTCCTCAC ATGCCCCTAT GTCCTCTATG AGCAAAGGAG CTGGGACAGC 
ACAAGTTACT CCCTTTCCCT TCTGGCCCAA GTCTCTTCAG AGAGAGACCC 
AGCTCAAGCC CCACATGCAG CAAGGTCCAT AAATACTCCT ACCTGCTGGC 
ATTTCTGCCA TGAGAGGGTT CAACACTTTC ACTAATGAGG CCTTCTCCTC 

A GTTCCTGGG ACCCCCACAG TGGACCTGGG AACATCTGGG ACTCCAGTTT 
CTAAACCTGG TCCCTCGG GT AAGTACAAAT CAATCGCATC TCTGTTAGAG 
CATGCCTGAT GACTGTCAAC ATCTCTGCCA TTTTCACTTA AATAAAGATA 
AAAAATCCTA GTGAATCTAC GGATGAGGAG TCATCCAGCA AACTTAATTG 
AGTGCCTAGT TTCTGCAGGG CTCTAGGGAT AAGAAAGGGG ACACAAAACA 
GTTAAAAATA TCTGCTGCAA GAAAGCTTAT TTTATTGTGA GGGTGATGGG 
AGTTGGTGGT GGTGAAGTTA CTGGAGATGA TGACAATAAG AATGGTGATG 
CTAGTGATGA TGATGGTGAT AAGGATGATA ATTATGAAGA TGGTGGTGGT 
GATGATGATG ATGGTNNNNN NNNNNNNNm NNNNNNl^NN NNNNI^INi;^^ 
Nm^lNNNNNm NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNIQimrN NNNNNNNNNN 
NNNNNNNIINN NNNNNNNNNN NNNNNNNNNN Ni;iNNNNNNNN NNNNNNNNNN 
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Table 2 (continued) 



Genomic Repeats 
(SEQroNO:2) 



18051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 



18501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NGCTGCCAGC CCTCTCCTGG 

18551 TGCTATTCAC TCTCAACTTC ACCATCACCA ACCTGCGGTA TGAGGAGAAC 

18601 ATQCAGCACC CTGGCTCCAG GAAGTTCAAC ACCACGGAGA GGGTCCTTCA 

18651 GGGCCTGN NN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

18801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 



18851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNN NCTC AGGTCCCTGT 

18901 TCAAGAGCAC CAGTGTTGGC CCTCTGTACT CTGGCTGCAG ACTGACTTTG 

18951 CTCAGGN NNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNAGAAT 

19001 TCAGTCGACC TACCGGCTTT GATGATTGCT CAGTTGAACT TAGAAATGCA 

19051 CTGTCTGCCC AATGGTCCAG TCTCATGAGT GTGACTCTTT TCTGCCTCTC 

19101 TTGGGTATCT GATCAAGATG GACTCAGGAA AAGTGCTCCA GATAACTGTC 



Exon Rl 



Exon R2 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



19151 
19201 
19251 
19301 
19351 
19401 
19451 
19501 
19551 
Exon R3 
19601 
19651 
19701 
19751 
19801 
19851 
19901 
19951 
20001 
20051 



TCCAATATAA CACTGCCCCT GCCATCACAC CCAAATGACT GGAAGTTTCA 
CAGGGTCATC AGCAGGGATT GGACTTCCAC CCCGGCCATC CCTCTGAATT 
TTCCCTCTTT TCTCCCCACC TCCCTTGCCC TTAGGTGTTA AAATTCTCTA 
ACTAAGATTT CTCTCAAGAC AAATGTGCCT CATTCACTTG TTTAATTCCC 
AATTCCAGCT TGTCACCTGT CTCAAGTCTA GGCTGTCCTG TCCCCATGCC 
ATGAGAATGC AAGAACCACA CTGAAATGTT AGAAAAATTC TTTTATCCAC 
AAGTATGCTC ACCGTCCCAA GCTGGACAGT AGTCAGTGCA CTCAGAGAAT 
CTAAGTGTGG CTTCTCATCT GTGTACCAGG CTTCTCATTT CCTGTGGGCC 
CTTCTTGTCC TTCCCTCCGC AATCTTGGGA CTCCTCCCTA GACAAAACTT 

TATTATTATT CCCCTCACCT GCCCTCTCCA G GCCTGAAAA GGATGGGACA 
GCCACTGGAG TGGATGCCAT CTGCACCCAC CACCCTGACC CCAAAAGCCC 
TAGGCTGGAC AGAGAGCAGC TGTATTGGGA GCTGAGCCAG CTGACCCACA 
ATATCACTGA GCTGGGCCCC TATGCCCTGG ACAACGACAG CCTCTTTGTC 
AATGGTGAGC AATTGTGATG TGGTTGGAGT TTCTTCTTCC TTGCTGAGCA 
GGCCTCTACT CTCTGTCTTG AGGTCACTCT CCCTGCCTGG CCACTGGTCT 
TGGCCATGTT GTCTGTATTT GATGATTGAT ATGAACTTCA CCGTTTCTTC 
TTCATCTTGT ACTGGAGACC TTCATCCTCA GGACCTTCTT CCCTGATCTG 
AGTGTACTTG TATAGAATCC TCAAAGCCCA TGTTCCCTGA AACTCCTTCA 
ATTGCACCAT GGTAGCACTG ACCCCTTTTQ GTCCCCCACC TTNNNNNNNN 
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Table 2 (continued) 



Genomic Repeats 
(SEQroNO:2) 



Exon R4 

20101 
20151 
20201 
20251 
20301 
Exon R5 
20351 
20401 
20451 
20501 
20551 
20601 
20651 
20701 
20751 
20801 
20851 
20901 
20951 
21001 



NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN TTCACTCATC 
GGAGCTCTGT GTCCACCACC AGCACTN NNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNIJNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNN NCCTG GGACCCCCAC 
AGTGTATCTG GGAGCATCTA AGACTCCAGC CTCGATATTT GGCCCTTCAN 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNGACTCCA GCCTCGATAT 
TTGGCCCTTC AGGTAAGTAC CAGTCAATGG CACCTCTATT AGAGTATGCA 
TGATGAGTGT CAACATCTCT GTCCTTTTCA CTCAAATAAG ATTAAAAATC 
ATAGCAT^TT GTACGTGATG ATGAGTCACC CAACAAACTT CTTTGAGTAC 
CCACTCTCTG CCAGGCCCTA GAGATAAGGC AGGGAACACA AAAGAGGTAA 
AAATCTCTGC CCTCAGAGAG CTTCTTTTAT TTTGAGGATG ATGTGGGATA 
GTGGTGATGA TGATGTTGCT GGAGATGATT ACAATAATGA TGGTGATGCT 
TATGACCATG ATGTGATGAT QATGGTGATT ATGAAGATGA TGATGATGAT 
ATTGATGATG GTAGTGGTTT TGACAGTAAT GATGATGTGA TGATGATGAT 
GATAGTGGTG GTGGTGATTA TGGGAAGGAT GACAGTGGTG GTGGTGATGG 
TGGTGGTTGT GGTGGTGATT GACAATGTGG TGGTGATATT GACAATGAGG 
ATGATGATGA TAGTGGTGGT GGTTATGATG GTTAAGGATG ATGTGATGAT 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



21051 
21101 
21151 
21201 
21251 
21301 
21351 
21401 
21451 
21501 
21551 
Exon Rl 
21601 
21651 
21701 
21751 
21801 
21851 



GGTGTTGGTG ATCACGGTAC TAGTGGTGGT GATGTGGACC GTCATGGTTG 
TGGTTGTGGT GGTGATGGTG GTGATCATGA TGATAATGAG GATGATGGTG 
GTGATTGTCA TGATGGTAAG GATGAAACAG TGATGGTGTT GGTGACCATG 
TTCCTGGTGG TGATGGTGCA GGTGATGATG TGGATGATGA TGGTGATGGT 
GGTGGAGATG ATACSGGATTA TGAATATGGT TCGGGTCTCT GACTGGTGGT 
GGTGATGACA ATAATGAAAA TGATGGTCAC AGTGTTGGTG ATGATGATGG 
TGGTGATAAC AAAGGTAATA GATAGTGTCT AGTATTATGG AACACAGAAC 
ATCACCAAAG GTTATGCTCA GCATCTAACT ATTATTATTT AGCATGCTCT 
ATGAAAAACT TTGATCGTTA TAGTCAAGGG AGGCATGAAA ACCTTCTATT 
TTATCACTCT CTTTAAATCT GGTTGCATAT GTTTAGAAAT AAATCTATTA 
CAAACTCTTA AATGTTCTCT ACTTTTGAAC ATAGTGTTTA TTTCCCACCT 

CCACTAC AGC TGCCAGCCAT CTCCTGATAC TATTCACCCT CAACTTCACC 
ATCACTAACC TGCGGTATGA GGAGAACATG TGGCCTGGCT CCAGGAAGTT 
CAACACTACA GAGAGGGTCC TTCAGGGCCT GG TGAGAGCC CTGCCCACCT 
CACTCTGCCC TGCCCACCTT GTCTTGTTCC ACCTACGTCA CCCATTCCAA 
GGCATGGAAG AAGATCTCAC CCACCTCCCC TCACCTGAGA GATAGCCCCG 
CCCCCTGATT ACAGCCCCTT CCACCTTACA TCTTCCTCAC TTCTATGTCC 
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Table 2 (continued) 



Genomic Repeats 
(SEQroN0:2) 



Exon R2 

21901 
21951 
22001 
22051 
22101 
22151 
22201 ' 
22251 
22301 
22351 
22401 
22451 
22501 
22551 
Exon R3 
22601 
22651 
22701 
22751 
22801 
22851 
22901 
22951 



TCAGCCATCT TACTCACCTC CCTCTTCCTC CTCCACAG GC TAAGGCCCTT 
GTTCAAGAAC ACCAGTGTTG GCCCTCTGTA CTCTGGCTGC AGGCTGACCT 
TGCTCAGGT G AGAACTGAGA ATAACCAGTC TGGCTACCCC AAGTGTTCCC 
AGGCCCAAGG AGTTTCATCA GCTTTCTTCC TTCCCTCCCT ATGGAAGTCC 
TCAGCACAAG TGGAATTCAG GCGTTGGTGG CTCCAGGATG AACATATCTG 
CTGATCCTAC CACCTCCCCC ATCAATCGAG AGAATTTGCA GGGCCCATCA 
GCCAGATCAG GCTTCTACTT TGGTCATCCT TCTGAATTTC TTACTTCTCC 
CTACCTCCCT CTCCTTCAGG TGTTAAATTC TCTTCCAAGG TTTCTCTCAA 
GATAAACATC CCCCATCCAC TTGCTTTCAT CCCCAATTCC AGCTCTTAAT 
ATTTCTCAAG TCTGGGCTCT CCTGTCCCCA TACCATGAGA ATGCAATTTT 
ATAAAATTCT TGTATTCCTG ACTCTACTCA CATTCCCAGG CTGCCTGGAA 
GTTGGTGCAT TCAGAGAATC TTAGTATGGC TTCTCACCTG TCTACCAGGA 
TTCTCATTTC CTCTGTCCCC TTCCTGTCCT GCCCCCAGGA ATCTCAGGAT 
GCCTCCCCAT AGGCAATCTA TTTAATGTCA TCCCCCTTAT CTGCCCTCCC 

TAG GCCAGAG AAAGATGGGG AAGCCACCGG AGTGGATGCC ATCTGCACCC 
ACCGCCCTGA CCCCACAGGC CCTGGGCTGG ACAGAGAGCA GCTGTATTTG 
GAGCTGAGCC AGCTGACCCA CAGCATCACT GAGCTGGGCC CCTACACACT 
GGACAGGGAC AQTCTCTATG TCAATGGTG A GCGGCTGTGA TGTGGTTGGA 
GATTCTTCCT CTTTGCTGGA CAGCTTCTTA CTCTCTGACT TGAGGTCACA 
CTCCCTGACT GGCCATTGAC GTCTTGGCTA TGTTGTCTGT ATGTGATGAC 
TGATGTCTGA ACTTCATAGT TTCTTCATCT TGGACTGAGT TCATCCTCAG 
TACCTTCTTC CCTGATCTGA GGGTACTGAT AGAGAATCTT CAAAGGCCCC 
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Table 2 (continued) 



Genomic Repeats 
(SEQroNO:2) 



23001 TGTTCCTTGA AACTTCTTCC ATTCCACTAG GGTATCTGTG ACCCCTATTT 
23051 GATTCCCCAC CTCTCCCTTA ACCCTTACCC ACTCTCCTCC CTCCTTCTCT 



23101 GTGCAGG TTT CACCCATCGG AGCTCTGTAC CCT^CCACCAG CAG TGAGTAT 

23151 TCAACCGATG CTCCAGTAGC CCCAATTATA CACCAAGCAG GGCAGGAGCT 

23201 GTCCTGTCTT CCTATGCCCC TATGTCCTCT TCATAAAGGA AGGGGCTGGG 

23251 AGGGCACAAG TTATTCCCTT TCCCTTCTGG CCAGCTCCAG AGAGAGACCC 

23301 AGCTCAGGCC CGATATGCAG CAAGGCCTGT AAATAGTTTT ATTTGCTGAC 

23351 CTTTCTGCCA TGAGAGGCTT GGATGCTTCC CCTGAAGAGG GTTTCTCTGT 

23401 AGCTCTTGGG ACTACCACAG TGGACCTGGG AAACTCTGGG GATCCACCCC 

23451 TTCTACTGGT CCCTTGAATA AGTACCAGCC AATGGCACCT CTGTTAGAGC 

23501 ATGGCTGATG AGTGTAAACA TCTCTTCCAT TATTCAGTCA AATAAAGATG 

23551 GAAATTCTTT ATAAATCTAG TGATGATGAG CCAACCAACA AACTTTATTG 

23601 AGCATTGTGA CAAGCCCTGG GGCTCTGCCA AATCCTGGGG ATATGGCATG 

23651 GATCATGAAA CAATTAATAA TCTCTCCTCT CAGAGAGCTA TTTTTATGAT 

23701 GATACTGATG GTGGCAATGA TGATGATGTT GATGGTGATT ATGACCATGA 

23751 TGACAATGGT GATGGTGGTG GTGATGATGG TAATGATGAT GATGGTGATG 

23801 TTGGTAATGA TGGTGGTGAT TATGACAATA ATGATGGTGA TGGTGACAGG 

23851 GATGGTGATG ATTATGATGG TGGTGGTGAT AACAAAGTTA ATGGATAATA 

23901 TATGAACTTA TTGGCTACTG AATATGCACC AAAGTGCTAT GCTCAGTGTT 

23951 TAACTAGTAC TATTTAATAT GATTTCTAAA AAAAATCTTG AATTATTATA 

24001 GGCAGAAGAA TCATGGGAAC CTTTTATTTT GTCACTCACT TTAAGTCCTA 

24051 TTGCATATTT TTTAAGTCAA TTGCAAACAC AGTTTCTCTG CTTTGAACAT 



Exon R4 



J 



99 



wo 2004/045553 



PCT/US2003/037041 





Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



24101 TGTGTTTATA TCCAGTCACC CCAATAGTGC ATAAACCTGC TGATTGGAGC 

24151 AACTGTGTCT TACTCCCTTG TGCTTCCCTA GTATCTGCTT CAGGACCTTG 

24201 TACATGGTAG ATCGACAGAT TTAGATCTAC AGGAAAATAT GGATTTTCCC 

24251 AGGGAAGGAA GGAATGAAGT ATGCTTTCTT ATAATGTATG GAAACTTTCC 

24301 TCTTCTGCCT TGGTTCAACT TTAGTGTCTG CCAGAGTTTA CACTGGAAAA 

24351 CTATATGGCA TCTGCTCCAC TCCCTCATCC ATGACAGACA TCATTAATTG 

24401 ATTGCAGCAT TCATGGCAGA CATCACCAAT TGATAATAGC ATTCATTTTC 

24451 TCTCAGTTCA AAACAGCTTC AGAATGGTTA CCAAAAAAAA AAAATTCAGT 

24501 CGCTACCAAT TCAATTGGAG CTGACTCAGG ATTATGGGAC AGAATTCAAG 

24551 AGAGTTAGGT TCCTTGATGA TGTGTAGTGG CTATTTGTTT TCCGGTCCAG 

24601 GCTAATNNNN NNNNNNNN^ NNN^^SIN^^^^NN NNNNNNNNNN NNNNNNNNNN 

24651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 

24701 CTTTGTGCGG CAAAGTTCAG GGGCCCCAAA AATTTCTGTG CCCCAATCAT 

24751 GGCGGACCTA GGTTTAGGCA CAAATTCCAG GTATTAAGTC CCTGGAGATG 

24801 TTATGGCTTT TGGTTTTCCT AGAAAGGCTC AGCTCAGGCT CAGCTTGGTC 

24851 ATGCTGATAT CCTTTCTTCC ACTTGGTCGA TTTGGCTGTT GATACTTATG 

24901 TATGCTTCAC GAAGTTTTTG TGCTGTGTTT TTCAGCTCCA TCGGTTGGTT 

24951 TATGTTCCTC TCTAAACTGG TTATTCTAGT TAGCAATTCC TTTAACCTTT 

25001 CATCAAGGTG CTTAGCTTTG CATTGCATTA GAACATGCTC CTTTAGCTCA 

25051 TCGTACTTTT TTATTGCCCA TCTTCTGAAG CCTACTTCTG TCAATTCATC 

25101 CATCTGATCC TCCATCCAGT TCTGCACCCT TAATGGAGAG ATGTTGCGGT 

25151 CATTTGGAGG AAAAGAGGCA CTCTGGCCTT TTGGGTTTTC AGCATTTTTT 

25201 TGTTGATTAT TTCCCATCTT CAGGAGTTTT AGTTTCAGGC TTTGAGGCTG 

25251 CTGATCCTTG GATGGGGTTT TTATGGGGGT CTTTTGGTTG TTGTTGTTGA 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



25301 
25351 
25401 
25451 
25501 
25551 
25601 
25651 
25701 
25751 
25801 
25851 
25901 
-.5951 
26001 
26051 
26101 
26151 
26201 
26251 
26301 
26351 



TGATGATGAT GTTATTGTCA CTTTCTGCTT GTTTTTCTTT CAATAGTCAG 
GTCCCTCTTC TGTAGGGCTG CTGCAGTTTG CTAGGGGTTC ACTTCAGGCC 
CTATTGATCT GATTCGCTCC CATGTCTGGA GGTGTCACTC AAGGAGGCTT 
GGAGAGCAGC GAACATAGGT GCCTGCTTCT TCTGGGACCT CTGACCTCGA 
GGGACACCAA CCTGATGCCA GTAGGATCGC TCCTGTGTAG GGTGTCTGAC 
AACTATTGTT GGAGGGTTTC GCCCAGTTGA CTGGCATGGA GAGCAGGACC 
CATTTAATGA AGCACTTTGT CCCCTGGTGG AGAGGGGGTT CTTCACTGGG 
GGGAAACCAC ATGTCTGGGC TGCTTGGATT CCTCAGAACT ACCAGAGGAG 
AGGCTAAGTC TGCTGGTCCA CAGAGACTAC AGCCATCCCT CCCACTAGGG 
GCCCAAGCCC AGGGAGTCCA AATTCTGTCT CTGAGCCTCT GGCTGGAGTC 
TTTGGAGATC CTGCAAGGAA GCTCTGCCCA CTGAGGAAGG ATGGGTCAGG 
GTTAGCCCTG AAGAGGCACT CTGGCTGCAG ACTGCCACAG CCGGTGTGTT 
GGGCTGTGGG GACAAGTCTT GGGACCAAGC CGTCCAGCCT ACCCGGCTCT 
AGCAGGGGAA AAGTACAGCC TGGAGCTATT GAAAGGGGTG CCGCCCTTCC 
CCCGCCCAGG GAGCTTAGCG TGTTAGGCAG TTGTGAGTCC AGTGCTGGCT 
GTCGCCCCTT CCCCAAGGAA CAAAAAAGAC TTAGCAGGCA GCCGCAGCCA 
GTGCTGGTCG CCCCTCCCCC GGGGAGTTCC GTAGGCTTAG GGAGATTCCA 
GCTGTAAGAA TCTGCGTGTT CTGGGGTTGG GACACTAGGT CCCAGTGGCA 
TGGGTTCGCG AGTGAGATCT TCCAATCTGT GAGTTGCACA GTTCCGTGGA 
AAAAGCACAG TTTCCCCCTC TTGGGTAGCC CGCTCACTCA CCACCTCCCT 
TGGCTGGAAG GAGGGGGTTC CCCTTCCCCG TGTGTCTCTC AGGTGGGCCA 
CCACACCACA CTGCTCTTCC TTCTCTCTGT GGGTCACTGC CAGCCTTCTA 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO: 2) 



26401 GTCAATTTTG ATGAGGGAAC CTGGACATTT TGGTTGCC2VG GAAGGATCAC 

26451 ACACTTATTA CAGTTTTTTT CAATGTGAGC CTCTGAGCGC TGCTGCTTAT 

26501 AGTCGACCAT CTTGGCCCCC AGAGTCACAC ATCTGTTATT TTTTGATGTT 

26551 TTGATTGTGG CAATTCTTGC AGAAGTAAGG TGGTATCACC TTATGGTTTT 

26601 GATTTCCCTG GTCATTAGTG ATGTTGAACA TTTTTTTGAT ATGTTCATTA 

26651 GCCATTTGTA TATATTCTTT CAACAACTGT CTATTTATGT CCTTAGCCCA 

26701 CTTTTTGATG GGATTGTTTT TTTCTTGCCA ATTTGTTTGA GTTCGTTGTA 

26751 GATTCTAGAT ATTAGTCCTT . TGTTGGATAT ATAGATTGTG AA6ATTTTCT 

26801 CCCACTCTGT GGGTTGTCTG TTTACTCTAC TGACTGTGAA GGAAAAGTCA 

26851 ATTTCTTATA CGAATTTGTC TCACTCCTAC TTCCAAATGA GATCCTGGGG 

26901 TTTTTTTTTT CTGTTAATCC TTCACT^TAC TTCTCCCACT TTTTTGAACT 

26951 CATTTGTTTA TATTCTGTTG TCTGCTTCTC TTTTATAGGA ATGTGACTTC 

27001 TTATGGGCTT TCTCTATTAT ACCACATATG GGTTTTTGTT TTGTTTTGTT 

27051 TTGTTTTGTT TTGTTTTTGT CCTCGGATCC ATTCTCCAAC CTCCTCCAGC 

27101 CTTCCCGTGC TCTGTGGGAT AGACGTCTGA CTCATGAAAA CTACATTTCC 

27151 CAGGCTCCCA TGCTAACTAG CTTCCTGTTA GGTTCAGCCA ATAGGAGGCA 

27201 TTGGTGGGAC AATGGTGGGC GGGGCTATGG AAGGGCCAGA GTATTTCTGT 

27251 ACCCCGCCCC CCTGCTCCCC TTCCAATGTT CCTGGAGCGG TGTAGGACCA 

27301 ATACTGTATA TATGGAAGGA AGGCAAGGTG GATAGATTGG AAGGAAGAAG 

27351 TGACAGATGG AAAGAAGAAG TGATAAATGG CAAGCGAGGC AAGGGAGCAG 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO: 2) 



27401 
27451 
27501 
27551 
27601 
27651 
27701 
27751 
27801 
27851 
27901 
27951 
28001 
28051 
28101 
28151 
28201 
28251 
28301 



AGGATGGATG AGTGGATTGC AAGAAAGAAA AAAATGGATG AAATATAAAA 
GGAGCAGGAG AGATGGATAA GTAGATGQAA GTAAGAAAAG ACTGGTGTAA 
GAAAGGAACG ATTGATGATG GATGATGAAT GGATCAGTGG TQATTGGGTG 
AAGGGATGAA TGGATGGATG GACAGATGGA TGAACAGATG GGTGGGTGGA 
TAGATGGATG GATGGATAAA ATGGGTAGGT GGATGGATGG ATGGATGGAC 
AGATGGGTGG GTAGGTGGAT GGATGGATAG ATGGATGGAT AAGTGAATGG 
ATGGATGGAT GGATGGATGG ATAAATGGAT GGATGGGTGA AAGGAAGGAA 
AGAAGTGAGA GAAGGAAGAG GAAGGATAGA CAGATGTTAG AAGGTACAAA 
TGAAAGGAAG GAAGCCAGCA AGAAAGAAAG GATGCATTAA TAGAATGAAA 
GATGGAAGGG AAGAAGAAAG GATGGAAAGA GAGAAGGAAG AATGAACAGA 
AGGAAGTTCA AGAGTGGTGA AAAGAAGAAA GGCAGGGAGA GAAGGAGAAG 
TAAACTTTTC TTCTAGAGAT TTGTCTTAAA CCTTAGCTTG GCTGGACACT 
GTGGTTCACG CCTGTAATCC CAGCACTTTG GGAGGCCGAG GCGGGTGGAT 
CATGAGGTCA GGAQATCAAG ACCATCCTGG CTAACACGGT GAAACCCTGT 
CTCTACTAAA AATACAAAAA AAATTTAGTC AGGTGTGGTG GTGCATGCCT 
GTGGTCCCAG CTACTCAGGA GGCTGAGGCA GGAGAATGGC ATAAAACCTG 
GGAGGCAGAG CTTGCAGTGA GCCAAGATCA CACCACTGCA CTCTAGCCTG 
GGCGACAAAG TGAQACTCTG TCTCAAACAA AAACAAAACA AAAAAACAAA 
AACAAAAAAC AAAACCAAAC CAAAACAAAA AAAAAAACCT TAACTCATAC 
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Genomic Repeats 
(SEQroNO:2) 



28351 


TTTCATAAAG 


TTCCACACAC AGGGAGTGAT 


TAGAAAGCAT 


TTGCTGATAT 


28401 


ATTTTATATA ATAAACATGT ACACCATATT 


GACCTGTGTG 


CCCAGCAGTG 


28451 


CTTACATGAT 


TTACAATGAT TAACTTGTTT 


AAGCTTCATA 


ACAACGGTTG 


28501 


AGGCAGGAAA 


CATCATTGTG AACCATTGTC 


ATCTCATTTT 


ACAGATGAGT 


28551 


AAACTGAAGT 


GCTGAGAGGT TGGTTATGGC 


TGCAAAGATT 


GTTGGCCATG 


28601 


TTAACCAATG 


CATAGAAGAT TAGCATACCT 


GGTTGTGAGT 


GCAGGAGAGA 


28651 


GAGAGAAATG 


GGAGAAAGGC AGAGAAGGAT 


CGATGGGGAG 


AGAGGAAGAG 


28701 


AGAGAGAGAG AATAAATTTT TTAAAAATGT 


CTAGAGTCAT 


GACTTCCGCA 


28751 


TCAGTGTGGT 


AATATGCAGC CTTTACCCTG 


GGAAAGATCA 


GAACCATTGG 


28801 


TACTTTTTAC 


AGAATCTTCC CTTCCTGCAT 


TTGGGTAGAA 


GGACCCCATC 


28851 


TGGACATCCC 


AAATCATTAA GCACACCCTT 


ACTGGCTGCT 


GGAGTTGTCT 


28901 


CCATTAAAAG 


TCACCGTTGG GTTTATTAAG 


AGGCGGACAC 


AGGGTCCTTA 


28951 


GAACACACTG 


CCCCCACCTG TCCCACACCA 


CCCCCCACCC 


ACCCATCATC 


on Rl 










29001 


CTCCCCAAGA 


GCTTCATCTC TCTCTCTCTT 


CCCCCTGCCC 


TAGCCGGGGT 


29051 


GGTCAGCGAG 


GAGCCATTCA CACTGAACTT 


CACCATCAAC 


AACCTGCGCT 


29101 


ACATGGCGGA 


CATGGGCCAA CCCGGCTCCC 


TCAAGTTCAA 


CATCACAGAC 


29151 


AACGTCATGC 


AGCACCTGGT GAGAGGCCTG 


CCTCCCGCTG 


CAGCCCTGCC 


29201 


ATGCCCATCC 


TAGGGCTGTT GCCTGCCTGC 


CTCTGACCAA 


CCCAAGCTCC 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



Exon R2 

29251 
29301 
29351 
29401 
29451 
29501 
29551 
29601 
29651 
29701 
29751 
29801 
29851 
29901 
29951 
30001 
30051 
30101 
30151 



CTTCTCCCTC TGCA GCTCAG TCCTTTGTTC CAGAGGAGCA GCCTGGGTGC 
ACGGTACACA GGCTGCAGGG TCATCGCACT AAGGT GAGAA ACTCCCCCAC 
CCACAGCGCA CCACCAAGAA CTTAGAGTTC TGACTGGGAG GTCCCTCTTG 
GGTTGGGGTG GGCTACATAT TTTTTTAAAT CTTTTTATCT TTCCTTTTTT 
TTTTTTTGAG ATGAAGTTTC GCTCTCGTTG CCCAGGCTAG AGTGCAATGG 
CACGATCTTG GCTCACTGCA ACCTCTGCCT CCCGGGTTCA AGTGATTATC 
CTGCCTCAGC CTCCCCAGTA GCTGGGATTA CAGGCAGGCA CCACCATGCC 
TGGCTAATTG TTTTGTATTT TTAGTAGAGA TGGGGTGTCT CCATGTTGAT 
CAGGCTGGTC TTGAACTCCT GACTTCAGGT GATCCACCCT CCTCAGCCTC 
CCAAAGTGCT GGGATTACAG GCGTGAGCCA CCATATCTGG CCCCATTCTT 
TTTTTTTAAA TGAATTTAAG GAGTGCAAAT GCAGTTTTTG TTACATGCAT 
ATATTCCATA GTGAAGTCTG CAGACAGTAG ACTTCCAGAC AGTAGCTTCT 
GGTGTATCAC CCGAATAGTG TACATTGTAC TTATTAAGTG AGGTTCCCCA 
CCCTTCTCCC ACTCTCCCAC CTTTCTGAGT ATCCAGTGTC TATTATTCCA 
CACTCCAGGT CCATGCTCTC ACGTATAAGT GAGAAC6TAT GGTATTCCAC 
CATGAGCTAA TGGACATGGA GTCCATTGGC TCCCACTTAT AAGTGAGAGC 
ATGCGGTATT TGACTATTTC TGAGTTTCAC TTAAGATAAT GGACTCCCAT 
TCCATCCATG TTGCTGCAAA ATACATGATT TCACTCTTTT TATGGCTGAA 
TAGTATTTCG TGGTATATAT ATATACCACA TTTTCTTTAT CCAGTCTTCT 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



30201 


ACTGATGGAC 


ACTTAGGTTG 


GGTCCATACC 


TTTGCTGTTG AAATAGTGCT 


30251 


6CAATAAACA 


TACACGTGCA 


GGTGTCTTTC 


TTATATAAAT 


GATTTCTTTT 


30301 


TTTCTTTCCT 


TTTTTTTGAT 


ATAACGAATT 


TCTTTTATTT 


GGGTTAAATC 


30351 


CCCCAATAGT 


GGGATTGNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


30401 


NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 


30451 


TGACCTGTCC 


GTATTGATAT 


ATAAAATGCT 


GCATTTAAAG 


TGTACAACTT 


30501 


GATATTTTGG 


TATAC!ATTGT 


TAAATCATGG 


CCACATTTCA 


GCTAATTAAT 


30551 


ATATCTATTA 


TCTCTACATA 


GTTATCATGT 


TTGTACCCTT 


TGACCAGCAT 


30601 


CACCCCATTT 


GCTCCTCCTC 


CCAGCCCCTG 


GCAACCACCA 


TCCTACTCTC 


30651 


TGCTTCTATG 


AGTCTGACAA 


TTTTAGATTC 


CACCTATAAG 


TTAGATTATG 


30701 


CGGTATTTGT 


CTTTCTGTGC 


CTGGCTTATT 


TCACTTAGCC 


• 

TAATGTCCTC 


30751 


CAGCTCCATC 


TATGTTATCC 


CAAGTGGCAG 


GATTTTCATC 


TTTCTTATAT 


30801 


ATTTCATTGT 


ATATGTGTAT 


GCCACATTTT 


CTTTACCCAT 


TCATCCATTG 


30851 


AAGGTCATTT 


AGCTTGTTTC 


CATATCTTGG 


CTATTTTGAA TAGTGCTGCA 


30901 


ATGAACATAG GAGTGCAGAT ATCTCTTTAA GATACTGGTT TCATTTCTTT 


30951 


CTTTCTTCTC 




TTCTGAGACA 


GAGTCTGACT 


CTGTCGCTCA 


31001 


AGCTGGAGTA 


CAGTGGTGCA ATCTTGGCTC 


ACTGCAAACT 


CTGCCTCCTG 


31051 


AGTTCAAGCG 


ATTCTCGTGC 


CTCAACCTCC 


CAGGGAGTTT 


TGCTCTTGCT 


31101 


GCCCAGGCTG 


AAGTGCAGTG 


GTGCAATCTT 


CACTCACCAC 


AACCTGTGCC 


31151 


TCCCGGGTTC 


AAGCGATTCT 


CGTGCCTCAG 


CCTCCCAGGT 


AGCAAGGATT 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



31201 
31251 
31301 
31351 
31401 
31451 
31501 
31551 
31601 
31651 
31701 
31751 
31801 
31851 
31901 
31951 
32001 
32051 
32101 



ACAGGCGCCC AACACCACAC CAGGCTAAAT TTTTTTGCAT TTTTAGTAGA 
GACGGGGTTT TGCCATGTTG GCCAGGCTGG TCTCAAATTC CTGGCCTCAA 
GTGATCCACC TGCCTCAGCC TCCTGAAGTG CTGGGATTTT ACAGGCATGA 
ACCACCACAC ATGGCCTCAT TTCTTTTAGA TATATATGGG TTGAGCTATT 
CTCAGAGGGT CCTTTTCTGC ATCTATTTAA GATCACATTT TTTTTATATT 
GTGGCAAAAA TACATGTAAC ATAAAATCTG CCATTTTAAC CATTTTTAAA 
TGTACAATTC AGTGACATTG ATTATATTCA CAATGTCATA CAGCCATCAC 
CACTATTTAT TTCTAATACT TTTCCATTGG GTAGATCCCC AACAGTGGGA 
TTGCTGGGTC AAATGGTAGT TCTGATTTTT TTTTTTTGTT TTTTGAGAAA 
TCTCCATACT GTTTTTCATT TGAGGTTGTA CTAATTTACA TTCCCACCAA 
CAGTGTATAA GAGTTTCCTA GGCCGGGCAT GGTGGCTTAT GCCTGTAATC 
CCAGCACTTT GCGAGGCCCA GGTGGGTGGA TCATGAGGTC AGGAGATCGA 
GACCACCCTG GCTAACATGG TGAAACCCCG TCTCTACTAA AAATGCAAAA 
AATTAGCCGG GCGTGGTGGC GGGTGCCTGT AGTCCCAGCT ACTGGAGAGG 
CTGAGGCAGG AGAATGGCAT GAACCCTGAA GGCGGGGCTT GCAGTGAGCT 
GAGATCGCAC CACTGCACAC TTCAACCTAG GCGACAGAGC GAGACTCCAT 
CTCAAAAT^ AAAAAAAAAA AAAAGGTTTC CTTTCAGTGC ATCCTTGCCA 
ACTTGAGTTT TCTGGGTTGG TTTGCACTCT CATGGTATTT ACTAGATACT 
TCTCCATTTA TATTTTTACT CAACCCATGC CCATAACACC ACTCCTCTAC 
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Table 2 (continued) 



Genomic Repeats 
(SEQ ID NO: 2) 



32151 CATTCCCACC AACCATGTAT AAGAGTTCCT TTTCTTGCAT CCTTGCCAAC 

32201 TTGACTTCTT TGGGTCAGTT TGCACTCTCT TGGTATTTAC TATTTACTTC 

32251 TCCATTTATA TTTTTAGTCA ACTGATGCCC ATGGCACCGC TCCTCTGAGG 

32301 CAGGTGCTGG 6TACTAGAGT GATAAGACAG ATGCTGTCCC TGCCCTCACC 

32351 CAGTGGAGAA GAACAGATGC TAAACAGGAA CATAAATATC TAAGTAAAAT 

32401 GGCTTCAAAT GGAGTAAAGT GATATGAAAC ATAAATAAAT AGCAAGTGAT 

32451 GGGTAGAGCA ACTTTACCCA GGATGAATCT TGGGCTGTGT CCCaAATGGC 

32501 CATGAAAACT GTTCCAGGCA GGGAGAACAG CATGAGAAAA GGTCTTGAG6 

32551 TGCAAATGAG CTTGGCATGT TCTATGAACA GCAAAGAGGC CAGTGTGGCT 

32601 GGAGCAGAGA GAGAGCAAGA A6AAAAQAGA GAAAGGATGA GACTCAAGAC 

32651 ATCAGCAAGT TTGAAGGGCC TTGGAGGACT TGGATTTTTT TTTTTAAGAC 

32701 AGCTTTGTTC TTGTTGCCCA GGCATGATCT CGGCTCACCA CAACCTCCGC 

32751 CTCCTGGGTT CAAACGATTC CTCTGCCTCA GCCTCCCGAG TAGCTGGGGG 

32801 TAACAGGCAT GTGCCCACCA CACCTGGCTA ATTTTGTATT TTTAGTAGAA 

32851 ATGGGGCTTC TCCATGGTTG GTCAGGCTGG TCTCGAACTC CCGACCTCAG 

32901 GTGATCCX3AC C6CCTCGGCC TCCCAAAGTG CTGGGATTAT AGGT6TGAGC 

32951 CACTGCACCT GGCTTGGATT TTTTTTGTTC TATATTGTGG TAACATACAC 

33001 ATCACATTAA ATTGATCATT TTAGCTATAT TTCCCGTTCA GTGGCATCAA 

33051 GCACATTCAC ATTATTGTGC AACCATCACC ACTATCATCC ATCTCCAGAA 

33101 CTTTCTCATC TTCCCAAACT GAAACTCCAT CCCCATGAAA CACTCATTCC 
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Table 2 (continued) 



Genomic Repeats 
(S£QIDNO:2) 



33151 
33201 
33251 
33301 
33351 
33401 
33451 
33501 
33551 
33601 
33651 
33701 
33751 
33801 
33851 
Exon R3 
33901 
33951 
34001 



TCATCCCCCT CCTCAAGCCT CTGGCACCCA CCATTCTACT TTCTGTCTCT 
GTGAATCTGA TGATTCTGAG GACCTCCTAT GAATGGAGGA ATCATATGGT 
ATATGTCCTG GTTTATACTG TATGGCTGGC TTATTTCACC AAGCATAATG 
TCCTCAAAGT TCATCCATGT TGTAGCATGT GTCAGAATTC CCTTCCTTTT 
CCACTTGTAT GTAAATGCTG TATTGTGTTT CTCCATTCAT TAGGACTTTG 
ATTTTTGCAG GGAGTTGTCA AGGGGTGCTG GGTTCTGGGG CTTCAATATA 
ATAAGAGTAA GCTAAACTGG TTCATTTCCT CCTTCGTGGA GACCATGTTC 
TGGTAGGAAC AGGAACAAAT AATTTATGAT TACATAGAGG GTGACCAGGG 
CAGTGACAGG GGAAGAGTGG AGGATTGTGG GACCCAGAGG AGGCTCCTGA 
CCTTGCCTAG GAAGATAGGA GGAGGAAGAG GAGGAGQAAG AGGAGGAGGA 
AGAGGAGGAG GAGGAGGAGG AGGGAGTCCT CTAAGCTGAG ACCTGGAGGA 
TGACCAGQAA GTTATCCAGG TAAGGAGAAA TGGGGAGAAG CTTCCAGACA 
AAAGTAACAG CAATTGCAAA GATCCTGA6A TGATAGATAA GGTCAGGTGG 
AGAAAGTGCA AACTGTCAAT GAGACCAAAA TATGGACTGT GAGTTGTGCA 
GTGACCACAA GTGGAGAGGT GCTAGGTGGC CTTCATCCCC CAAAGCTGCA 

CCTCTCCCTC CTCAG GTCTG TGAAGAACGG TGCTGAGACA CGGGTGGACC 
TCCTCTGCAC CTACCTGCAG CCCCTCAGCG GCCCAGGTCT GCCTATCAAG 
CAGGTGTTCC ATGAGCTGAG CCAGCAGACC CATGGCATCA CCCGGCTGGG 
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Table 2 (continued) 

Genomic Repeats 
(SEQroNO:2) 

34051 CCCCTACTCT CTGGACAAAG ACAGCCTCTA CCTTAACGGT GAGCAGCTAT 

34101 CAGCCCCATC TCCCTGCCCC ACCCCCCAGC CCCCACTGCA GTCCAGGAGG 

34151 GTGTCTGTTT GCCGGTTCTC TAGGGAAAGA CTTGGGGTTC AAGTCTTGGC 

34201 ATTACCACTG GCCCTCCCAT AACCACAATG CAAGGTTGGA CTTTGATTAA 

34251 TCCCATTTTA CAQATGAAGA AACTGAGGCT TAGACAGGCT AAGCAATTTA 

34301 CCTTGACAGT GGTGGAACCA GGATATGAAC TCCACTTGTC AGCATTCGGT 

34351 GCTATGATCC ACTCCACATG TTTAACTCAC AGAAGAGTCT TCCTGGTGGG 

34401 GGCACTTGGG GGACAAAAAA CACATTTCCG GCTGTGAGCA GTGGCTCACA 

34451 CCTGTAATCC CAGCACTTTG GGAGGCCAAG GCGGGCGGAT CACAAGGTCA 

34501 AGAGATTGAG ACCACCCTGG ACAACATAGT GAAACCCTGT CTCTACTAAA 

34551 AATACAAAAA TTAGCTGGGT GTGGTGGCGC ACGCCTGTAG TCCCACCTAC 

34601 TCGGGAGGCT GAGGCAGGAG AATCGCTTGA ACTCGGGAGG CAGAGGTTGC 

34651 AGTGAGCCAA GATTGCGCCA TTGCACTCCA GTCTGGGTGA CAAGAGTQAA 

34701 ACTCTGTCTC AAAAAAAAAA AAAACAATTT CCCCTCCCTG CTTTCTTCTC 

34751 ACCATTGACG AGGGATGGGC TTCTCTCCTA CCTGAGGCCC CCTATACCAG 

34801 GAAGATCTAT GGGATCTAAT CTTCAGCX3CA CACTGGGCCT CAGCATTGGT 

34851 CTAGAACTCA GGATAAGATA GCATTTAAGA AGGCATCCCC TAAATGGGGT 

34901 TCTGAGAGGC AAAGCATGAC CGTGGAGAAT TGACAAAATA GCTCGCCTTT 

34951 CATCCCCTCC ACCX3CCAACC CAAGAACAGT GCTTATCATC ATGACCCCAT 

35001 GAGGTGGGCA CCCCATATCA CTTATATGAG GTACCTTTAG GTAGGTACCG 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO;2) 



35051 
35101 
35151 
35201 
35251 
35301 
35351 
35401 
35451 
35501 
35551 
35601 
35651 
35701 
35751 
35801 
35851 
35901 
35951 



GGATGTGGAG AGACATCCTG GGCTTTCATT ACTCTTATTT TAGCAAAGAG 
GGAATCTGAG GCACAGAGAA GGGAAGGGAC TTGCCCATGC CCACAGCGAG 
TTTTTGGCTA GTATGGGTCT TGATGTTCTT TCTGGGTCCG Tmmimmiil 
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
NNNNNNNNNT TCCTGCGTGG GAGATGTGTG GATTTGATTT GTATCTGGAA 
AGATGATTTT TTATTGGTGA CAAAGCAGTT AAAGTTAATC TTCACAGTTG 
TGCGGAGAGT GACCACGCGA GTTAGTCTTA TCCTTATTTT TTTGATCATC 
CCGCTACACA AGACAAAGCX3 AACCGCACAG GCAACATCAG CAGGCCCCAT 
TGGTGTGTTC CCCTCTATGG GTCCATGTGT TCTCATCATT AGCTCCCACG 
TATAAAGTGA GAAGATGGCA GTATTGGTTT TCTGTTCCTG CATTAGTTTG 
GTAAGGATAA TGACCTCCAG CTCCAACCAT GTTCCTGCAA CGGACATGAT 
CTCATTCTTT TTTATAGCTG GATAGTATTC CATGGTGTAT ATGTTCCTCA 
TGTTCTTTAT CCAGTCTATC CTTGATGGGC ATTTAAGTAG ATTCCATGTC 
TTTGCTATTG TGAATAGTGC TTCAGTGAAC AGGTGTCTTT ATGATAGAAA 
AATTTATATG CCTTTGGGCA TATATGCAGT GATGAGATTG CTGGGTCAGA 
CGGTAGTTCT GTTTTTAGCT CTTTGAGGAA TCATCCTGCT GCTTTCTACA 
GTGGATGAAC TAATTTACAC TCCCACCAAC AGTGTATAAA CACTCCTTTT 
TATCTGCAAC CTCAGCAGCA TGGTTTTATT TCTCTTTATG GCTGAATAGT 
GTTCCATTGT GCATATATAC CACACTTTCT TTATGGATTC ATCTGCTGAT 
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Table 2 (continued) 



Genomic Repeats 
(SEQroNO: 2) 



36001 
36051 
36101 
36151 
36201 
36251 
36301 
36351 
36401 
36451 
36501 
36551 
36601 
36651 
Eacon R4 
36701 
36751 
36801 
36851 
36901 



GGACATATAG GTTGATTCCA CATCTCTGCT ATTGTGAATA GT6CTGTGAT 
AAACACACAG GTGCGGGTTG GGTCTTGATG ATCTCAGTTA ACATCCAGTC 
CCTTCAACTT GGCTATTGCA GGGAGCTGTT CCCCCTTGTA AACTGCACAG 
CTTATGTGCT TCATTTTGTT CCTTCATTTA GATTTACCAA GCAGCTACTA 
TTAACCAGGC CACAATGTGC CTCGCCCCCA GGAACAGAGA TAGGTTACAT 
GTGCATCCTG TCCTAATGTA ATCTCCAGGG GGGCGGAGAC TGTTTTGTTC 
TACCCTATAT TCCCCAAATG TAAAGGGAGC CTTGCACATA CTAAGCCCTT 
AATAAACATT CATTGGGTGG AGGAATAGAT TGGAGGAGGC CTGGAAGGGG 
AGGCGGGGGT TATGGATGGA TAGGAGGATA GACTTGTGAA CACAAAGGTA 
GTGAGAGCCT CTCATTGGAG GCATGCTGGA GACGTGAGTA GGGAAGGGTC 
AGTGCTAATT GAAATATCAG GAAATTCTTT CTAGTGGTGA ACACATTTAA 
GTCAAATATT AGATGATACA TAAATGTATC CATAATCTCT AGATACACAA 
AGGGAAAGGC ATCCAGGCAG GGGCCCCATA TGGACAAAGG CATGGAGTAT 
CTGGGACGGT TCCACCACCT CCTCTTACGT GTGACTTCTT TGTTTCAAGG 

TTACAATGAA CCTGGTCTAG ATGAGCCTCC TACAA GTACG TGTCTTTGAA 
TCTAGTGCCC ATTTCAATCT CCATGGGTCT TGGTTCAAGC TTTTCTCCTC 
ATTCATGAAG GAAGGTTGCC CGAAATTCGG GCTGGTCCCC TAGGTGGTGA 
GGGGCATTGT CTCAGTGGGA GGAAGAATGC TGAGTCCTTG GCCCTGTTTT 
TAGACCTGCA GCCATAGTCT TGGCTTTGTG AATTTTCCAT GTCCCTCTGG 
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Table 2 (continued) 



Genomic Repeats 
(SEQ ID NO: 2) 



36951 

37001 

37051 

37101 

37151 

37201 

37251 

37301 

37351 

37401 

37451 

37501 

37551 

37601 

37651 

37701 

37751 

37801 

37851 



GTTGGAGGAA GAAGTTTGAA CAAGCATTCC CTACACGGGA TAGAGGTTGA 
GGTCAGATGA TGACCTCTGT TAGTCTGTAC CCTCCTTGAT AAGAAAATCT 
CCTCCAAGTG CCCCAGCAGA GGCTTCATGG TCAAGCTGCA GACTCTGCTG 
GCTACTGGTT TTGGCTAl^T TTGCCCATTG CCTCATCCAG TGATCCACTC 
GTCTATCTTT CCAGCCATCC ATTTTTCTAT CCTTCCAGTC ATCTCTCAGA 
CACCACCTGT CCTTCCATCC ATCCATCCGT CCATCCATTT ACCCATCCAT 
CCATCCACCC CATTTTCCTG ACCATTTACC TCCTCGTCCT TCCTTCCATC 
TGTCCTTTTA TCCATCTATT CATCCATCAC CCATCCTCCT GCCCATTCAC 
CTGCTTGTCC CTCCTTTCTT CTGTCCTTCT ATACATCCAT CCATCCATCC 
ATCCATCCAT CCAGCCATCC ACTCATCCAC CACCCACCCA TCCTTCTGCC 
CACTCACTCG CTAGCCCCTC CTTCCTTCTG TCCTTCCATC CATCCATCCA 
CCCATCTTCC TGCCCATTCA CCTGCTTGTC CTTCCTTCTA TCTGTCTTTT 
ATCCATCTCT CCATCCATTC TCACCATCCA TCCATCCATC CTTCTCCCTA 
TTCACTGGTT TGTCTTTCCT TCTGTCCTTC CAACCATCCA CCCATCTCTC 
CATTCATTCT CCTCTTCATT CACCATGTTT CCTTATTTCT GTCTCTTCCA 
TCCATCCATC TATCCAGACA GACATCTCCT CCCCCCATTC TCCTCCCCAT 
TCACTCAATT GTCCTTCCTT CCATCTGTCC TTTTATCCAT CCATCCACCC 
ATCCATCCAT CCATCTATCC TTCTCCCCAT TCACCTGTTT GTCCTTCTTT 
CTGTCCTTCC AACCATCCAT CCATCCATCA TCCATCCATC TATCCTTTTC 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



37901 CCCATTCA.CC TGTTTTGTCC TTCCTTCTGT CCTTCCAACA TCCCTCCATC 

37951 TCTCCy^TCCA TCCTCCTGCC TATTCATCTG CTTGTCTTTC CTTCCTTCTG 

38001 TCCTTCCATC CATTCATCCA TCTGCCCATC CACCCACTCA TCCTCTTGCC 

38051 CATTCACCTG CTT6TCCTTC CTTCCACCTG TCCTTTTATC CATCCATCCA 

38101 TCCATCCATC TTGCTCACTC CTCCACTCAC ACAATCACTC CTTCCCTCAG 

38151 TCTCATTTAT GGCCCACCTG TGAATGGTTG TCCTGGCTTG GACCACTGAT 

38201 GAAGCCCAGG GGAGCTTCTC CCACTAGTGG TGGGCTTTTG TCCTCTCTGA 



38251 TGGACTGTTC CTTCCACA GC TCCCAAGCCA GCCACCACAT TCCTQCCTCC 

38301 TCTGTCAGAA GCCACAACAG GTATTTGGGG CCATTTTTCC TCCTCGAAGA 

38351 TTAGAATAGC ATTTCAATCA GACACCTGCC CTCGTGGAGT CCCAGATTTT 

38401 ATGAAATAAA TAGACCATCA TAATGTCAGA TGTTTTGGGG TGAGATACCT 

38451 GGCSATAGTTG GGAAGGAGGA GGGCTTTCTG GAQAAAGTTT CACCTGAACT 

38501 GAGTCTTTAA GGATGACTAA GAGTGATTCA GGCATiATAGG GCATGAATAG 

38551 TATAACTGAA AGAGGGGAAT CTGTGAGCAA AGCCTCAGTG GCCAGAAACA 

38601 GCATAGAGTA TAGGGAQAAG TQAGAGAAAT TTGGTTTGCA TQAAACATAA 

38651 AGCTTAACCC AGAGTGGATG GATAAGTGAG ACTGAAAGGT CAGCAGGAGC 

38701 CAGATTGGGA AGGGCCTTGA ATGCCAAGTC AAGAAATTTG AACTTAACAC 

38751 TGAAGGCCAT AGGGAGCTGT GGATGGTACT AGAGCAGGGG CAGCCATAGT 

38801 GA6ATTGTCA TTTCAGAAAQ ATTCTTCTTG TGTTCAGTAT AGAGAATGTC 



Exon R5 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



38851 CTTTAGACAG GGCATCCAGT GAGTCTGCCA GGTGCTAATC AGGGTGAGAG 

38901 AAAATAAGAC CTGAACTGGG ATAGGGGGAG GAGAGAGAGG ATATATGTGA 

38951 TGAATATTCA GTAAAGAGAA TTGGTGTTAC TTGGAGGGGA GAAGACACAT 

39001 AGCTTCTGAC TTGCGATGGC CACACTCAGT TTAATAATGA GCGCAGTCTG 

39051 ATCTAGTCTC AGACCAGCCC TCAGTTGCAG ACGTCTCTCC TCCCCTCCTG 



39101 CA GCAT6GGG TACCACCTGA AGACCCTCAC ACTCAACTT.C ACCATCTCCA 

39151 ATCTCCAGTA TTCACCAGAT ATGGGCAAGG GCTCAGCTAC ATTCAACTCC 

39201 ACCGAGGGGG TCCTTCAGCA CCTG GTGAfiA CCCTGGTCCC AGCAGCTCCT 

39251 GGTGGGATAA ATCCTACCCC CAACCTCTGT TCCTCGGCTT ACCCTCTTCC 



39301 TCCTTCCTCT CAAG CTCAGA CCCTTGTTCC AGAAGAGCAG CATGGGCCCC 

39351 TTCTACTTG G GTTGCCAACT GATCTCCCTC AGG TGAGACC: ACTTCCTGGC 

39401 CATTTGCCAG TAACAACCAC CCCTTTTGTG ACCACCCCTT CCTCAGCTTT 

39451 CCCCTGCTCC TCCCTCCACT GCTCTTTACC TGCAGAGGTC TCGGGACCTC 



39551 CCACTGGTGT GGACACCACC TGCACCTACC ACCCTGACCC TGTGGGCCCC 
39601 GGGCTGGACA TACAGCAGCT TTACTGGGAG CTGAGTCAGC TGACCCATGG 
39651 TGTCACCCAA CTGGGCTTCT ATGTCCTGGA CAGGGATAGC CTCTTCATCA 



Exon Rl 



Exon R2 



Exon R3 



39501 



TCTAGAGTCC TCAAATGCCT CTCTCCCCAG GCCTGA6AAG GATGGGGCAG 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO: 2) 



39701 ATGGTG AGTG TCAGGCTGAA CTTGGATTTA CAGTGACTTT TGGGGAGTTG 

39751 GTTTCTTTGT TTTTGAGATG GAGTCTCACT qTATCACCCA GGCTGGAGTG 

39801 CAATGGTGCA ATCTTGGCTC TGCAACAGTG ATTCTCCTGC CTCAGCCTCC 

39851 CAAGTAGCTG GGATTTACAG GTGCATGCCA CCACGCTCAG CTAATTTTTG 

39901 TATTTTTAGT AGAGATGGGG TTTCACCATG TTGCCCAGGC TGGTCTCGAA 

39951 CTCCTGACCT CAGGTGATCC ACCTGCCTTG GCCTCCCAAA GTGCCAGGAT 

40001 TACAGGCATG AGCCACCATG CCCX3GCCCAC CATGACTATT ATTTGTCCCT 

40051 GTTGTATGCC CTTTCCTCTC TAAAAT^AAAT AGCCCAAGGC CTGGCTGGGG 

40101 GACACCCTTC CCCAAACCAC CAAGGGGAGG GTCTTTCCCA TTATTTTGAG 

40151 TAAATAGCAT GAAATTCTTT GACCAAATTA ATGTCATAAA TTGTTTGTCT 

40201 CTTTCTCCTT CACTTTTGTT TCCAACTTGG TTGCGGTATA ACTATCAAAT 

40251 ACAATTGTAT GTATTTAAGA TGTATAATGC AGTGATTTAA TATATGTGTA 

40301 GCTTATGAAA TGATTACCAT GATCAAATTA GTTAACACTG CTTTCATGTC 

40351 ACATAGTTAC CGTGTGTCTG TGTGCGTCTG l^XGAGTrAG AGAGAAAGAG 

40401 AACATTTAAG GTCTACCCTC ATAGAAAATT TCAGGTTTAC AATACAGTAT 

40451 TATTAACTAT AATCATCAAG CTTTATACTC GATCCCCAGA ACTTATTCAT 

40501 CTTGTAACTA AAAGTTTGTA TTTTGTGACC AACATCTCCC CATTTTCTCT 

40551 ATCACCACCC CCATGCCCCC AGCCCCTGAT AACCATCATG CTACTCTCTG 

40601 CTTCTGTAAG TTTGACTTCT GATCCCACAT ATAAGTGAGA TCATGCAGTA 

40651 TTTGTTTCTC TCTATCTGGT ATATTTCACT TAGCATAATG AACCCCCCCC 
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Genomic Repeats 
(SEQroNO:2) 

40701 AGGTACATCC ATAATGAATt' TCAATTCAAA ACCCAAGTGG CTGAGTCGTG' 

40751 GCATCCTTTG GGACAGGATA GCAGGTCCCT TCTATATAAG GATCCTCTGT 

40801 GTCAGTGGTT ATTACCAGGG GACAATTCTG CACTTCTGCC CCACCCCACC 

40851 CCCCAACTGG GAGACTCTAG GCAATATCCG AAATCATTTT TGGGTATCAC 

40901 AACTCAGGGA GGGAAGGAGG GTGCAACTGG CACCTAGTGG GTCGGTAGCC 

40951 CATTTTCCAG TGCACAGGAG ACAACCACCC CAGGGAATGA TCCAGCCCCA 

41001 AATGCCAATA ATTTCAAGGG TGAGAAATCC TGTTGTACAT GGTCTCAAAG 

41051 TTCTTAGGTG GGCACAAGGC TGACATTTAT CACACTTTAC TGTAATTACT 

41101 TGTTAAATTT ATCTGATTCC CCCTTACCCT GTGAACTCAA CAAA^TTACG 

41151 GTCTATTATG AGTGCCACTG TACCCTCGGT TCGCAGTACA TCAGCACATC 

41201 ATAGTATGGA AAGAATCATT GAATGAGTGA GCAAATTAAA GATTTGTGTC 

41251 TCTGCTGTAA CTCACATTCA TTAATTCATT CATTCAGCAA ACATATATGG 

41301 GTGGCTGTTC TGCCCCAAGC CTTGTACTGG GTCTGGAGAT AGAAGACACA 

41351 TTTTTCTGTC TCTGAAAAAC TCATACTCAA GTTAACAACA AATTACGGGC 

41401 ACAACAAAGA CCCCACTGCT GTTATTAACA GGGTACTATG GGAGCTGAGA 

41451 GGAGGAGTAA ATTAAGGAGG GCTTCCTGGA GGAGGGTGTT ATATACCCGG 

41501 CCCTGTGCCG GGACACATAA TGATAAGACA GACTTGGGCC TCTGCTGTCC 

41551 TGGAGCTCCC TCTCACTGGG CTCTTGAAGC GTGAGCAGGA GTTTTGCAGG 

41601 AAATGAAAAG GATGCATTCC TAGAAGTGGG AACTGCATAG CACATGCAGG 
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Table 2 (continued) 



Genomic Repeats 
(SEQIDNO:2) 



41651 
41701 
41751 
41801 
Exon R4 
41851 
41901 
41951 
42001 
42051 
42101 
42151 
42201 
42251 
42301 



AAAGCTCAGC TCAGAAGAAT CTGTGTAATA TTCCATTTTT CCCTCTCTTT 
GGGGCAACTT TCTGTCTAAG AGCTCCTGCA ATGCCCAGCX3 TGTGGACCTG 
AAATTGATTC TGACAGTAGG CAGGGGACTG CTGGGCAACT TTGGCTCTGC 
ATTTTGTGAT CAACATTTCC CCACCATATG TTGCCTTTTC TTCTTCTCTG 

TGGCTCCAGG CTATGCACCC CAGAATTTAT CAATCCQQGG CGAGTACCAG 
ATAAATTTCC ACATTGTCAA CTGGAACCTC AGTAATCCAG ACCCCACATC 
CTC7VGAGTAC ATCACCCTGC TGAGGGACAT CCAGGACAAG GTG GGGCATC 
TCTCACCCCT CCCGTCTTCT CTGTCCTGTG TGCTTCTCTC CCTCTTCTAC 
CTGATTTCTC TGTTAAGTGA TCACTTTAAA TGCTTCACTT CACTATGTAT 
TCTGGGTTCT CTCTCAGTTT CCAAAAGTAC TCTCTTGACT ACCATTCCCA 
TTTCACAGAT GGGCAAACTG AGGCTCAGAA AGGGGCGTGG TGTGCCTAGG 
GTCATACAGT GCTTTAGGAA CAGAGTTAGG ATTTGAACTC TGGTCCCCTT 
TGCTCCAAGT CCTGTGTTTT TTTCCACTGG CATCAGCGGC CCCTCCACCC 
CCAAGAGGCC TCCATCTCAC CCACTCTCCC TACCCATCTT TCTAGGTC 
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Tables 



Genomic Carboxy Termfnal 
(SEQIDNO:3) 



Sxon CI 

1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
1051 
1101 
1151 
1201 
1251 
1301 



ACCACACTCT ACAAAGGCAG TCAACTACAT GACACATTCC GCTTCTGCCT 
GQTCACCAAC TTGACGT AAG TTCTGAAOGT CATAAGCAGT GACCAAGCTT 
GTGGCTGTGT CTCTGAGCAC CCTTGAGCTA GACGTCCCCA GTGGGGTACC 
CATTCTCCCC TACATCCCTG TCTAGCTAAT CCTACCATCT CCTCCCATAA 
ATCCTCAAGG TAGGGAGTGA GGATTAACCT CATGGGGCCA CCAACTCCCA 
GCATACACCT TCTTTTTTTT CTGGACACTT GGGAAAATAT AACTTTTTGA 
TGTAGAACTC AAAATATTAG CCCAATAATA ATATTTAACA TCAACCAGCC 
TCCTCTCATT TAATTCTCAC AACAQAATCT ATGAGTTGAG TGCAAAAATC 
ATCCCTATTG TGCAQATGGG AAAACTGAGG GTCAGAAAAG TGAACTTCCC 
AAGAACTGTC AAAGTTGGGA TTTGAACCCA GGTCTCTGAT GACTGGATGA 
AGGAATGAAG ATACCTATAC TTGGGAATGA GGAGGGTCGA CAGGACACGA 
GGGCTGACTT TGTATATTTC TAAACTTCAA AGATTTTCTG TATTTCAGCT 
GGGAATATGG TAGAAGGTTA ATTGGAACAA AAAAATGCAA AGCAATGAAT 
AAGACCTCAG TATTTGCTAT GCACAACAGG GTGACTGTAG TCCCACAAAT 
AACTTCACTG TACATTGTTA AAATATAACT AAAGGTGTAT GCTTGGATTG 
TTTGCAACAC AAAGGATATA TGCTTGAGGG GATGGATACC CCATTTACCC 
TGATGATTAT TATGCATTAC ATGCTTGTAT CAAAACATCT CATATACCCC 
ATAAATATAA AAACACCTAC TATGTACCCC AAAAAATTAA AAACAAATAA 
AGGCATGGTG GCACACACCT GTAGTCCCAG CCACTCAGGA AGCTGAGGTG 
GGAGGATCGC CTGAGCCTAG GAGGCTGTAC TCCAGCCTGG GTGACAGAGC 
GAGACTCTAT CTCAAAAAAT AAAATAAAAT AATAAAAAGT AGAAATCAAG 
AGGGAAAATG TGGGAGAAAT TGGGATAATT TTAACAATAC CTTCCACCAG 
AGTGATGATG AAGAATGCAT AAGTCACTTC TTAGTGGTCT TGATCTATAA 
AAAGTGTTCA ATAAATATCG ATTATTGTTA CTGTTATTGC TTCTAGACGT 
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Table 3 (continued) 



Genomic Carboxy Ternunal 
(SEQroNO: 3) 



1351 



AATTCCTGGA AGCATTTTTT 



TTTTGAGATG GAGTCATGCT 



1401 CTGTTGCTCA GGCTGGAGTG CAGTGGTATG ATCTCGGCTC ACTACAACTG 

1451 CCTCCTGGGT TCAAGCAATT CTCCTGCCTC AGCCCCCCAT GTAGCAGGGA 

1501 CTACAGGCAT GCGCCACCAC ACCCGGTGAA GTTTTGTATT TTTATTAGAG 

1551 ACAGGGTTTT GCCATGTTGG TCAGGCTGGT CTCGAACTCC TGACCTCAGG 

1601 CAATTTGCCT GCCTCGGCCT CCCAAAGTGC TGAGATTACA GGCTTGGGCC 

1651 ACTGCATCCA GCCX3AAGGCC TCCCATTTTG ATCAGAACCC TTCTCTAGAC 

1701 TGAGGGTGGG TGCCTCTAGA TCTTTTGCTC TTTAAAGACA GCAACCGATG 

1751 ACCCTGCTGA TGCTGAGTAC TGGCTGAATT CCTGTGGTCT CTGTCCCTAG 



1801 GATGGACTCC GTGTTGGTCA CTGTCAAGGC ATTGTTCTCC TCCAATTTGG 

1851 ACCCCAGCCT GGTGGAGCAA GTCTTTCTAQ ATAAGACCCT GAATGCCTCA 

1901 TTCCATTGGC TGGGCTCCAC CTACCAGTTG GTGQACATCC ATGTGACA GG 

1951 TACAAGGTGG GGTGGCTGGT TTCCTAACTG GAAGAGGTGG GGTTATGAGG 

2001 A7\AGATGGGG CTTCTCGGTA CCAGTGGAAT TGGTGGAGGC TCTAGAGAGG 

2051 GAAAGGGAGG CTTTCTGGAG ACCCATGTAG GTGACCTCTG GCAGTAGATC 

2101 ATCCAACGAG GCAGGAACAG AACACCAGCC ATTGCATCTA AGAGAATAGC 

2151 TATTTTTACA TGTAAAAAGA ATTGTGTTGA ATGAATGAAT CAATAGATCA 

2201 TTTATTTTGA ATCAATTTAT TGATTCATTC ATTTAATTAA TGAATAATAA 

2251 ATGATTCAGT ACATAATTGA TTAATTGATG TAATTGAGAA TTGATTTAAT 

2301 TGATTAATTG ATCAATTAAA ATGATCAATT AAATGAATGA ATCAGTAAAT 

2351 GAATAATTCA TTCATTCAAT AAACAATGGA AGTAGGCCGG GCATGGTGGC 

2401 TCACGCCTGT AATACCAGTA CTTTGGGAGG CCCAGGCAGG CAGATCACGA 

2451 GGTCAGGAGA TTGAGACCAT CCTGGCTAAC ACGGTGAAAC CCTGTCTCTA 
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Table 3 (continued) 



Genomic Carboxy Terminal 
(SEQ ID NO: 3) 



2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3151 
3201 
3251 
3301 



CTAAAAATAC AAAAAAAATT AGCCAGGCAT GGTGGTGGCC ACCTGTAGTC 
GCAGCTACTC GGGAGGCTGA GGCAGGAGAA TGGCGTQAAC CCGGGAGGCA 
GAGCTTGCAG TGAGCCGAGA TCGCGCCACT GCACTCCAGC CTGGGCGACA 



ATAAACAATG GAAGTAAACA CGTACTGATA ACACAGTGTG ATCATTGCTA 
TGATAAGGGA ATTTCAGGGG CCTGTGGGAG CCCCAAGGAG GAACACACAA 
CCTTGTCTTG GAAAGTTTTA TGTAGGAAGG GGTGAAQAAG CTGAGATCTG 
ACAGAGAATG GGACCTAGCC AGGGGTAATA GATGGAQAAT TGTGCTCCAT 
GCATCTATAA CCTAGAAGAT AGAAAGAATA TGGCATCTGG CCGGGTGCGG 
TGGCTCACGC CTGTAGTCCC AGCACTTTCA GAGGCTGAGA TGGGTGGATC 
ACCTGAGGTC AGGAGTTCAA GACCAGCCTG ACCAATATGA TGAAACCCCA 
TCTCTGCTAA AAATACAAAA ATTAGCCAGG CATGGTGGTG CGTGCCTGTA 
ATCCCAGCCA CTTGGGAGGC TGAGAGAGGA GAACTGCTTG AACTCGGGAG 
GCGGAGGTTG CAGTGAGCCG AGATTGTGCC ATTGCACTCA AGCCTGGGCA 
AAAAGAGCAA AACTGCATTT CAAAAAAAAA AAAAGTGGCA TTTTGGGGCA 
AGTTTAAGAA GATTGGTGTA GCTGGAGCAT CCACTTTGAT ACTGGAGAGG 
TGACAGTTGA AGCCAAAGAT GTGGGCAGAG ACTTTGTTGG GCACTGGAAT 
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Table 3 (continued) 



Genomic Carboxy Terminal 
(SEQroNO:3) 



3351 GGCTTGGGGA GGAACATGAC ACACTCATQA GTTCTGCTTT AGAAAGZU^ 

3401 TGAAATGAAT TCTGCTCATC CTCTGGGTGC TGTGTGCAGA ATGGAGGGTG 

3451 GGGGGAGAGA AGAGCAAAGG CAAGAAGACC CTTTAGGAAC AATGATCATT 

3501 AGTTAGAAGA CTCTGGGTTT CTCAGCACCT GCAATTGCTG ACTACACCCC 

3551 CAGAGAAACC CAGTCTCrTT TCCCCCATGT TGTAGAGAAT TCTTACa^TG 

3601 CTTGGTAGAA AGAGAATTGA ACAGGTAGAT GGGTGGATGG ATACAAGCTG 

3651 GACAGATGGA TGGAGGAAGA TCCTCCATCC AATATAGAGC TGTTACCTAA 

3701 AACCCTCCAT CCCACCTTTA AAATCCTAGC TCAQCCAGGC GCGGTGGCTC 

3751 ACACCTGTAA TCCCAGCACT TTGGGAGGCC AAGGCGGGTG GATCACTTGA 

3801 GGTCGGGGGT TCGAGACCAG TCTGACCAAC ATGGTGAAAC CCCCTTCTCC 

3851 ACTAAAAATA CAAAAAAAAA AAAAAGTTAG CCAGGCAGGG TGGCX3CATGC 

3901 CTGTAATCCC GCTACTCGGG AGGCTGAGGC AGGAGAATGG CTTGCACCCA 

3951 GGAGGTGGAG GTTGTGGTGA GCCAAGATCA CGCCATTACA CTCCAGCCTG 

4001 GGCAAAGAGA GTGAAACTGT CTCAAAAAAC AAAACAAATG ACCCCCCTGC 

4051 CAAAAAAAAA AAAAAAAAAA AAQAAAAGAA AAAAAGAAAA GCCTAGCTCA 

4101 GCTCACACTG TCAGGAATAA GTAAGCTAGC TGGAATCATC TCTTTCTTAA 



4201 ATGGAGTCAT CAGTTTATCA ACCAACAAGC AGCTCC AGCA CCCAGCACTT 

4251 CTACCTGAAT TTCACCATC3V CCAACCTACC ATATTC CCAG GACAAAGCCC 

4301 AGCCAGGCAC CACCAATTAC CAGAGGAACA AAAGGAATAT TGAGGATGCG 

4351 GTGAGAAGGG GGTGGTATGT CCACTCTGTT GCCATGCAGA AACTGACTTA 

4401 T6CATACTGG GTAGCCACAG GGTGACTTTT TATAACAATC CACAAAGACA 



Bxon C3 



4151 



AACCCTGCCT TGATAGTGGA TTTTTACATA 



AATTCTAGAA 



122 



wo 2004/045553 



PCT/US2003/037041 




Table 3 (continued) 



Genomic Carboxy Terminal 
(SEQIDNO:3) 



4451 
4501 
4551 
4601 
Exon C4 
4651 
4701 
4751 
4801 
4851 
4901 
4951 
5001 
5051 
5101 
5151 
5201 
5251 
5301 
5351 
5401 
5451 
5501 



GGTTCTTATT CCCATTTAAT ACACAAGCAC AGAGAGGTTC AGTAGCTGAC 
CCAAGGTCAC ACAGCTAAGT CATACCCTAG AAQAGCATGT CCTTTGATAT 
ACATACCTGG GCAAGTGGTT GTCATGACAA GAAGCAAAAT AGACGGAGAA 
GTGTGCTCAG TGGCTGAAAA TTCTCTGATG CTACTGGGGC CAGGATTCTG 

ACCTAAGAAA CATCGCCCTG TCTTTCA GCT CAACCAACTC TTCCGAAACA 
GCAGCATCAA GAGTTATTTT TCTGACTGTC AAGTTTCAAC ATTCAGGTAA 



GTTCTAACTC 


AGGACCTAAT 


GACTCTAGGA 


ACTTCTGCTG 


TCCTTTAAAT 


AGAAGTGTCC 


CCAAGCCATA 


GCTTTGATGG 


AAGAGAGCCC 


TAGAAATAfJA 


GAGCTGTTAA 


CTAAAAACTA 


GCTTTTTCCT 


AAAGCTGGAG 


CCCAACTGGC 


TTCAACACTC 


AAGAGAGCTG 


GTGTAAATCT 


CAGCAGACAT 


AAAGGTACCT 


GGTGCTGAGG 


CCATGGAGTC 


TAGAGTGTAG 


AATCTACTAC 


ATTAAGACAT 


CAGCTACTGA 


AATCAGGACC 


CATGGAAGAC 


GGGGGAAGGA 


GGGGACTAAA 


ACCAGATTAC 


TTAGAATCTA 


GCAGCCTAAC 


TGTGCTTTTC 


AATGAGAGGT 


ATCATTTCCA 


ATGGTGGGGG 


GTACCAATGA 


TTTTTTTTTT 


TTGACAACTG 


CCTTGAGAAC 


AGGCTTTCCT 


CACTAAACAA 


ATTCTGAATC 


AGAACAAATA 


AAGATAAGCC 


CTGAGAATAG 


GGCTTTTTCA 


AGGAGCTGGC 


AAACAGATCA 


AATAGTGACT 


ATGTTCTGCA 


QATTGATGTC 


TGGAGAACTC 


TACAGCTATT 


TTGACTGCTA 


GGCAGCTGGT 


TTTCACAGAT 


ATCATGATTC 


TGAGGCTGCC 


AGTTTTCAAA 


GTTACCGAGG 


ATCTTGCTGG 


ATGCAGTGGC 


TTGCGACTGT 


AATCCCAGCC 


CTTTGGGAGG 


CCAAGGTGGG 


TAGATCGCTT 


GAGCTCAGGA 


GTTTGAGACC 


AGCCTGGGCA 


ATATGGTGAA 


AACCCATCTC 


TACAAAAAAT 


ACAAAAATCA 


GCTGAGCATA 


GTGGCATGTG 


CTGTAGTCCC 


AGTTACTTAG 
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Tables (continued) 



Genomic Carboxy Terminal 
(SEQIDNO:3) 



5551 GAGGCTGAGG TGGGAGGATG GCTTGAGCCC AGGAGGCAGA GGTTGCAGTG 
5601 AGCTGACATT GTGCCATGCA CTCTAGCCTG GGCAACAGAG CCAAAGCCTG 
5651 TCTCAAAAAA AAAAAAACAA ATAATAATAA TAATAAAATA CTGAGGATCT 



5751 CTTTTTACTG AGATTTAACA CTTTCCTTAA CTAAACACCC CTCGAATTTT 

5801 TGCAAGCCTT TGGTTCACTT CTAGACTTCT GGAAAAATTG ATTTGGACTA 

5851 TTTTGGCCAA TGTTCTCATT GATTTTATGG GTATTCAGAA GTTGTTACCC 

5901 CAACATTCCA GAAATGTTCT CCCTGTGGCT ATTACTTTAT TTATTTATTT 

5951 ATTTATTTAT TTATTTATTT ATTTGAGACG GAGTCTCCCT CTGTTGCCCA 

6001 GGCTGGAGTG CAGTGGCGCA ATCTCAGCTC ACTGCAACCT CCGCTTCCCA 

6051 GGTTCAAGCG ATTCTCCTGC CTCAGCCTCC CAAGTAGCTG GGATTATGGA 

6101 TGTGCACCAC CACACCGGCT AATTTTTGTG TTTTTAGTAG AGATGGGGTT 

6151 TCACTGTGTT GGCCAGGCTG GTCTCGAACT CCTGATCTCA AGTGATCCAC 

6201 CCGCCTTGGC CTCCCAAAGT GCTGGGATAA -^AGGCATGAG CCACT6TGCC 

6251 TGACCTCCCT GTGGCTATTT TTAAATGAAT TAAGTGGAAT AAAATTAGAA 

6301 ATTCAGTTCT TCTCCCACGC TAGCTGCATT TTAAGCATTT AATAACAACA 

6351 TGAAGCTACT AATGGCTGCA TTGTGTAGTG CAGATGTAGA ATTTTTTTTT 

6401 TGTTTTTTGT TTTGTTTTTG AGATGGAGTC TCGCTCTGTC ACCAGGCTAG 

6451 AGTGCAGTGG CGTGATCTCG TCTCACTGCA ATCTCTACTC CCCGATTCAA 

6501 GTGATTCTCC TGCCTCAGCC TCCCAAGTAG CTGGGATTAC AGGCACGTGC 

6551 CACCACACCC AGCTAATATT TGTATGGATG GTCTCAATCT CCTGACCTCG 

6601 TGATTTGTAT GGATGGTCTC GATCTGACCT CATGATCCGC CTGCCTGGGC 

6651 CTCCCAAAGT GCTGGGATTA CAGGCGTGAG CCACTGTGCC CGGCCGACAT 

6701 AGAATGTTTA CATCATTGCA GAAAGTTTCT GCAGGAAGAG CCTAGAAGGA 



5701 



TGAAAGAGCA CTGTGGAAAT AATGCAAGTT AAAATGCCAC AAAGCTTGCT 
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Table 3 (continued) 



Genomic Carboxy Terminal 
(S£QIDNO:3) 



6751 
Exon C5 
6801 
6851 
6901 
6951 
7001 
7051 
7101 
7151 
7201 
7251 
7301 
7351 
7401 
7451 
7501 
7551 
7601 
7651 
7701 
7751 
7801 
7851 



GAAAGCCTAG AATCATGATA AAATTGCAGA TATCTTTGCT TATCCCTGTC 

CCCTTCCAGG TCTGTCCCCA ACAGGCACCA CACCGGGGTG GACTCCCTGT 
GTAACTTCTC GCCACTGGCT CGGAGAGTAG ACAGAGTTGC CATCTATGAG 
GAATTTCTGC GGATGACCCG GAATGGTACC CAGCTGCAGA ACTTCACCCT 
GGACAGGAGC AGTGTCCTTG TGGATGGTA A AGCTCCCTGG GTCATTGGGA 
CTGAGGTGGA AGCTCCCACT TCCTCACCTG GGTCCTTCCC TGGGAATCTG 
AAGGCTTGGG GTTGATTCGT CATCGAGCTT TCTCAGACTG GGAGAAAGTG 
GCTTAGTTCT CCTAAGCTTT ACCCATCATT GAAGGAAAGA AAAGGACGCC 
CGAGGGATAT GGGAGGCATT TGCCCTCTTC TGGCCAGCTC TGTGACCTCA 
GGCTAGTCAC ATCTCCTTTC TGGACTTCTT ATCTCTCTGT ACTTAGCAAG 
CCACTTGGTT TTTGGTTCCC ATCTTGCCTG CCCTAGATGG TATTGCTCCT 
CCACCCCCAG GCAGCTGCAG TGTTAAACAA TTACCCTGAT TAGTTATTGT 
TGTTGTGTTG TTTGTTTGTT TTTGAGACAG GGTCTCACTC TGTCACCTAG 
GCTGGAGTGC AGTGACATGA TCTCAGCTCA CTGCAACCTC AACCCCTGGA 
CTCAAGCAAT CCACCCACTT CAGCCTCCCA AGTAACTGGG ACTACAGCCA 
TGCGCCACCA CACCCGGATA ATTTTTGTAT TTTTTCTAGA GATGGGGTTT 
TGCAACATTG CCCAGGCTGG TCTTGAACTC CTGAGCTCAA GCATGCCACC 
TGCTTCAGCC TCCCAAAGTG CTGGGATTAC AGGCAGGCAG GCACCACTGC 
AGCTGGTTCT GGTTTTTTGT GTTTGTTTTT TTCTTTTAGA GGCAGGGTCT 
CGCTCTGTTA ACCAGAATGG AGTACAGTGG TGCAATCATA GCTCACTGCA 
GTCTTGAACT CCTGGGCTCA AGCGATCCTC CCACCTCAGC CTCCTGAGTA 
CCTGGAACTA CAGGCACGTG TCACCACGCC TTGCTAATTT CTAAATTTTT 
TGTAGAGACA GGGTCTJCACT ATGTTGCCCA GACTGGTCTC TAATTCCTGG 
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Table 3 (continned) 



Genomic Carboxy Terminal 
(SEQroNO:3) 



7901 CCACAAGTGA TCCTCCTGCC TCAGCACSGTC AATGAGGGCT TCCAGTTTCA 
7951 AGTTGTATGT GATTCATCCT CAACAAATGT GGTAGGATG6 ACCTATTTTC 
8001 CAACTCCAGA GATGGCTTCA AGGTGGCTCA ACTTTGCATA TCCAATTTTA 
8051 CCCATTCAAA GAATAGTTAT ATACATTGTA CCATGTATCA GGAATATAAC 
8101 AGAGAGTAAC TGTTTGCTCT TTCACCACTA TATTCCAAGA ACCCCATATT 
8151 CTGCCTGGCA CATAATAAAC ACTCAAGTCA TATTTGCAGA AGGAATAACT 
8201 AGATTTCATA CAAGGTTCTT TTCAAGTCAA ATGCGAATAA CGTTTTAGAC 
8251 GGGACCTTCC AATGCCTGTG TGCACTGTCC TTGATTCCGA ATTATTQTTG 
8301 TGCAAGAGAG CACTGTTQAT CCTTCAGAAT CAACAAGCCT TTCACATGCC 
8351 TGTCACAGGT TTTTCTTTTT CTTGTTTTAC CAATTTTGTT TGTTGTTTGT 
8401 TTGTTGTTAT TGTTTTGTTT TGTTTTT6TT TTTTATTTGT TTTTATTTTT 
8451 TCTTTTTTTT TGAGACAGAG TCTCGCTCTG TCACCCAGGC TGGAGTGCAG 
8501 TGGCACGATC TCCGCCCACT GCAAGCTCCG CCTCCTGGGT TCATGCCATT 
8551 TTCCTGCCTC AGCCTCCTGA GTAGCTGGGA CTACAGGCGC CTGCCACCAT 
8601 GTCTGGCTAA TTTTTTTTGT ATTTTTAGTA GAAACAGGGT TTCACCATGT 
8651 TGACCAGGAT GGTCTCGATC TCCTGACCTC GTGATCTGCC CACCTGGGCC 
8701 TCCCAAAGTG CTGGGATTAC AGGCGTGAGC CACCACACCC AGCCCCAATT 
8751 TTTTTTTTAA TTAAAATTGT TGTCAGCTCA CAAGCTTTCT AAAAACAGGC 
8801 CATGGACCCA GCATCGCTGT AGTTTGCCAA ACCCTTGCCT TGAATCAGTA 
8851 CCATCCAATA GAACTTTCTG CAGTQATAGA AAATGTTTCT ATCTGTGCTA 
8901 TTCAGCACAA AGCCATGTGT GATTACTAAG CTTGAAGTGT GGTTAATGTA 
8951 ACTGAGATAC CGAAGTTTTA ATTTTATTTA ATTTTAATTT AAAAAGCCAC 
9001 TTGTGGCTGC TCCATATTGC ACACTACTTT TTAAAATTAT TATTTGTATA 
9051 TATTTAAGGG GCACAAGTAC AATTTTGTTG CATGGATTTA TAGCCCAGTG 
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Table 3 (continued) 



Genomic Carboxy Terminal 
(SEQ ID NO: 3) 



9101 
9151 
9201 
9251 
9301 
9351 
9401 
9451 
9501 
9551 
9601 
9651 
9701 
9751 
9801 
9851 
9901 
9951 
10001 
10051 
10101 
10151 
10201 
10251 



GGGAAGTCTG GGCTTTTAGG GTATCTATTA CCTGAATAAT GTACATTGTA 
CCCATTGAGT AATTTCTCAT CATCCACTCT CCTCCACTCC CCAACCCTTC 
CAAGTTTCCA CTGTCTATTA TTCCACTCTC TATGTCCATG CCTATGCATT 
ATTTAGCATT GACATGTCTA TGCATTATTT AGTCAAATAC ATGTGCTATT 
TGACTTCCTG TATCTGAGTT GTTTGACTTA AGATAATGAC CTTCACTTGC 
ATCCATGTTG CTGCAAAAGA CATGATTTCA TTCTTTTTTA TGCCTGGGTG 
GTATTGCATT GTGTGTGTGT GTGTGTGTGT GTGTGTAGAG AGAGAGAGAG 
ATCACATTTT CTTTATACAG TCCTCCATTG ATGGGCACTT AGGTTGATTC 
CATATCTTTG CTATTGTGAA TAGTTTTGTG ATAAACACAC AGGTTCAGGT 
GTCTTTTTGA CAAAATTATT TATTTTCCTT TGTGTAGATA CCCAGTCGTG 
GGATTCCTGG ATCAAATGGT AGTTTCATTT TTAGTTATTT GAGAAATCTC 
CACGTTTTTC ATAGAGATTA TACTAAATTA CATTCCCACC AACAGTGTGT 
AACGGTTCAC TTTTCTTGCA TCCTTTTTAA CATCTGTTAT TTTTGTCTTT 
TTAGTAACAG CCATTCTGAC TGGCGTAAGG TGGTATCTCA TCATGGTTTT 
AATCTGTATT TCTCTGATTA TTAGTAATGT CGAGCATTTT TTCATATGCT 
TGTTAGCCAT TGGTATGTCT TCTACATCTT TAAGAAGCTG GCTATGGGCT 
GGGCGCAGTG GCTCACACCT GTAATCCCAG CACTTTGGGA GGCCGAGGCA 
GGCGGATCAC GAGGTCAGGA GTTAAAAACC AGCCTGGCCA ACATGGTAAA 
ACCCTGCCTC TACTAAAAAT ACAAAAAATT ACCCAGGCAT GGTGGTGCGC 
CTGTAATCCC AGCTACTCAG GAAGCTGAGG CAGGAGAATC ACTTGAACCC 
AGGAGGCGGA GGTTGCAGTG AGACGAGATC ACATCATTGC ACTCCAGCCT 
GGGTGACAGA GTGAGACTCT ATCTTGAGAA AAAAAAAAAG TTGGCTATAA 
CAGGGTTGTA GAAGTAGAGG AACCAGTAAC CCTTCTCGCC ATGCCTGATG 
ATGGCTTTAC ATCCCTGTCT TCATGGAGTT TATGCTGTCG TGAGGAATAA 
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Table 3 (continued) 



Genomic Carboxy Terminal 
(SEQIDNO:3) 



10301 


CAAGAACAGG CAGTTGTCAA TTATAAATTA TTTGATGTGA 


ACCTATTCAT 


10351 


ACATGGGTGT GGTCATCAGG GAAGGCTTCC TGGAGGAAAT 


GACATTGAAG 


10401 


GTGAATTCTA AAAGATGACG ATAAACCACC AAGTGAAGGA 


GAGCTTASlAT 


10451 


GTGTTTTTAG GCAGAAGAAA AACCTTTTGG GTGAAAATTT 


TAAAACTTAG 


10501 


AGAGGTCCCA TCAGTTTCCA ACTGCGATGA TCCATTCTCT 


CCACCACTGC 


10551 


CCTTGGGCCC AGCCCAATTT AGGTCCACCA TGCCCAGAGG 


CATGAATTTA 


10601 


ACTTATGACA CTCTTGTGGT GGAATAATGG CTTTGGGCTT 


ATGTAGCCAT 


10651 


GTGTCATTTT TTTAGAGATA CAAATTGAAA TATTTGGGGT 


GAGATGTCAT 


10701 


GGTGTCTACT GGCCTCTAAA ACTTCAGTGA AAACATTTAC 


TTTCACTGAA 


10751 


ATGTCAATAA ATCATAAATT GGATGTATAT GTTTTAGTTG 


GAGGAAATAT 


10801 


AAACCACTAA ATCTAGGTGA TGCATATTTA TTATACTCTT 


CTCTCTGCTT 


10851 


TTTTGTACGC TTGTAAAATT GTATTTAAAA GAATAAGACA 


CACTTGGCCG 


10901 


GGCGCGGTGG CTCACGCCTG TAATCCCAGC ACTTTGGGAG 


ACCGAGGTGG 


10951 


GTGGATCATG AGGTCAGGAG TTCAAGACCA GCCTGGCCAA 


CATGGTAAAA 


11001 


CTCCATCACT ACATACAAAA ATTAGCCAGG CATTTTGGCG 


GGCACCTGTA 


11051 


ATCTCAGCTA CTTGGGAGGC TGAAGCAGGA GAATTGCTTG 


AACCCGGGAA 


11101 


GCAGAGGTTG CAGTGAGCCA AGATCACGCC ACTGCACTCT 


AGCCTGGGCA 


11151 


ACAGAGCAAG ACTCCATCTC CAGAAA2^AAA AAAAAAAAAA 


GACACACTCA 



11201 CATGCACCCT CCATTTCTTT CATTTCTAGG GTATTCTCCC AACAGAAATG 

11251 AGCCCTTAAC TGGGAATTCT GGTAAGTCTC AAAGAAGCCC CAGCCCAGGG 

11301 TAGGGAGGGG GTAGCCTGAT GGTGCTTTGC CTTGTCCAAG AGCACCAGGC 

11351 ACACAGAGTC TTGGATGAGG ATCAAAATTG CCAACCCATG GCAAAGACTA 

11401 TTGAGGCATA GTAAAGGGAT AGCAGGGATC CTGGCTTTCT GGGGGCCCAG 



Exon C6 
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Table 3 (continued) 



Genomic Carboxy Terminal 
(SEQIDNO:3) 



11451 
11501 

Exon C7 

11551 
11601 
11651 
11701 
11751 
11801 
11851 
11901 
. 11951 
12001 
12051 
12101 
12151 
122.01, 
12251 
12301 
12351 
12401 



TTTTTGGGGG CATCAGAGGC ATGAGGTGTT GAGCCACTAA GCTCTCTTCC 
CCAGGGGCTG TGCCCATCCT CAGGCCACAT AGGGTCCAAG AAGGAGCCCT 

GGGACGTGGC AGGAGGTGGC TCACCCCAGC CCTTGTCTCC CC AGACCTTC 
CCTTCTGGGC TGTCATCCTC ATCGGCTTGG CAGGACTCCT GGGAGTCATC 
ACATGCCTGA TCTGCGGTQT CCTGGTG AGC AAGGAAGGGT TGCTTGTCTT 
CTTAACAATT GGGTTGTAAG AGTTCTTAAT ATATTATAAA ACCATACTAT 
ACTATACACA AGTCCTTTGC TGGATATATG TTTTGCAAAT ATTTTCTCCC 
AGTTCACGGA GTGGCTTTCC TATTTTCTTT TTATAATTTT ATTTTTAATT 
AATTGACAAA TAATGAATGC ATATATTTAG GGGATACAAT GTGATGCTTT 
GGTATATGTA CAATTATGGA ATGACTCAAT CAAGCTAATT AATATGTCCC 
TCACCTCTCA TACTTATTAT TTCTTTGTGG TGT6AACATT GGCAACCTAT 
ACTCTTAGCA ATTTTGAAAT CTACATTATT ATTAACTATA GTTACTATGT 
TATGCAGATC TCAAAAACTT CACAACCTAT ATGCTGATTA CAAGATATTG 
AGAGAAAAAG TGATTGCAAA GAGTGTAAAT AAAATAATGT AAGAGGGAAA 
AATCTAACAA AATTAGTCGT TAQGGAAATG TACACGGAAG TCACAATGAG 
AGGQCACTTT TCACAAGAAT GGATAA^^TT ACTATAACAA 
GTGTTGGTGA AAATGTGACA 6AACTGGAAC TCTCATAAAG TGAAAGTGGA 
AAATAGCTTG GCCATTTCTT TGAAJ\ATTAC ACACACCTAC CGTAAGACCT 
ACCATCCCAC TACTAGTAAT TTATCTAAGA GAAATAAAAA CATATGTCTA 
TATGAAGACT TGTACACAAG TAAATGTTCA TAACAGCTTT 6TTTGTAATA 
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Table 3 (continued) 



Genomic Carboxy Terminal 
(SEQroNO:3) 



12451 GCCAAACTCT GAAAACAAGC CCCTAATGTC CATTAACAAA TATATCCTGA 

12501 CAATGGAATA TTATTCAGC3V ACAAAAAGGA ATTATTAATA CATTAATAAA 

12551 TTATACAGCA ACATGTATAA ATTGCAAAAT AGTTATGCCT AGTGAAAGAA 

12601 TCCAGATGAA GAAAAGAGTA CATGCCATAT GATTCCCTTA ATAGACAAAT 

12651 TCTAGAAAAT ACAAACTAAT CTGTAAGGAC AGGAATCAGA TCAGCGGTTG 

12701 CCTGGGAATG AAAATGTGTT TGCAGTGGCA GGGAAAAAGG AATTGTAAAA 

12751 GAGCAGGAAG AAAGTTTTTT TGTTGTTTTT TTTTTGTTTT TTCTTGAGAC 

12801 AGAGTCTTAG TCTATCGCCC AAGCTGGAGT GCAATGGCAC GATCTCAGCT 

12851 CATTGCAACC TCTGCCTCTC GGGTTCAAGC GTTTTTCCTG CCCCAGCCTC 

12901 CCAAGTAGCT GGGATTACAC ATGCGCACCA CCACACTCAG CTAATTTTTG 

12951 TATTTTTAGT AGAGACX3GGG TTTTACCATG TTGGCCAGGC TGGTCTCGAA 

13001 CTCCTGACCT CAGGTGATCC ACCCGCCTTG GCCTCCCAAA GTGCTGGGAT 

13051 TACAGGftGTG AGCCACCATG CCTGGCCAGG ACGAAAGTTT TGGGGATGAT 

13101 GGATGGATGT TCCTTATGTT GATTGTGGTG ACGATTCAAT AAGTTATGAT 

13151 CAGAACTTAT CAAAACATTC ACTTTAAATG TGTGCAGTTT ATTTTATGTC 

13201 AGTTATGCCT CAGTTAAGCT GGACAGATGT AGAGGAGGAA GGGAGGGAGA 

13251 GAGGGGGCTG AGATCASGAC CAAAAGCCAG AGAGAAAGAG ACTGAGAATG. 

13301 AGATGAQAGA GAAATGGTAT TTAGACAGAA GACAGGCGAT AGATGATTQA 

13351 TAGTTGACAG ATGATTGGTG GATANNNNNN NNNNNNNNNN NNNNNNNNNN 

13401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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Table 3 (continued) 



Genomic Carboxy Terminal* 
(SEQIDNO:3) 



13451 
13501 
13551 
13601 
13651 
13701 
13751 
13801 
13851 
13901 
13951 
Bxon C8 
14001 
14051 
14101 
14151 
14201 
14251 



NNNNNNNNNN NNNNNNNNNN i;iNNNM23NNNN lilNNNNNNNNN NNNNNNNNNN 
AGGAGGTTTA AACAAAACGC AATTATGTTG AAATGACAAT GATTGTGGAT 
ATAAAGGTAG ATAGAAATAG ATATTTGTGA AGATAATGGT TAGATAAAAA 
TGATAGGTAA CAGATATTGA TAGATCTTGA TAAGTAGATG ATAAATACAT 
GATTGATGGA TGACAGGTGA TTGATAGATG ATTTGATGGA TTATAAATAG 
GAGATGATTG AGAGGTGAGA GATAATTGAT GGTTATTTGA TTGGTAGATA 
ATTGATTGAC AGGTTGATAA ATATTGATAG CTAGATGATA GATAAATAGA 
TCATTGGTAG ATATGTGATA TATTGATAAA GAAATTCAGA GGCAAAAGGA 
GAGAGAAATG AAGGGGATAT CGGAGGGGGA AAAATTTTTT TAAACCGAGA 
GTGAAACAAG GAGACAGAAG AAAAGAAAGT GGTGAT^GA GGAAAAGAAC 
TGAGGGAGAA ATTAAATGAA ACAATGAAGG GAGACAGAGG AAGCATAAGG 

CCTCTGGCTT TGGCCATATT CTCACCCCTG TGGTCTCCTC TCCCTGGACG 
GCTGACCAGT CCATTCTCAC GCCTCCTCCT CACCCTCATA GGT GACCACC 
CGCCGGCGGA AGAAGGAAGG AGAATACAAC GTCCAGCAAC AGTGCCCAGG 
CTACTACCAG TCACACCTAG ACCTGGAGGA TCTGCAAT GA CTGGAACTTG 
CCGGTGCCTG GGGTGCCTTT CCCCCAGCCA GGGTCCAAAG ATVGCTTGGCT 
GGGGCAGAAA TAAACCATAT TGGTCGG 
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Table 4 



Human cDNA of CA125 
(SEQroNO:4) 



1 AAGCGTTGCA CAATTCCCCC AACCTCCATA CATACGGCAG CTCTTCTAGA 

51 CACAGGTTTT CCCAGGTCAA ATGCGGGGAC CCCAGCCATA TCTCCCACCC 

101 TGAGAAATTT TGGAGTTTCA GGGAGCTCAG AAGCTCTGCA GAGGCCACCC 

151 TCTCTGAGGG GATTCTTCTT AGACCTCCAT CCAGAGGCAA ATGTTGACCT 

201 GTCCATGCTG AAACCCTCAG GCCTTCCTGG GTCATCTTCT CCCACCCGCT 

251 CCTTGATGAC AGGGAGCAGG AGCACTAAAG CCACACCAGA AATGGATTCA 

301 GGACTGACAG GAGCCACCTT GTCACCTAAG ACATCTACAG 6TGCAATCGT 

351 GGTGACAGAA CATACTCTGC CCTTTACTTC CCCAGATAAG ACCTTGGCCA 

401 GTCCTACATC TTCGGTTGTG GGAAGAACCA CCCA6TCTTT GGGGGTGATG 

451 TCCTCTGCTC TCCCTGAGTC AACCTCTAGA GGAATGACAC ACTCCGAGCA 

501 AAGAACCAGC CCATCGCTGA GTCCCCAGGT CAATGGAACT CCCTCTAGGA 

551 ACTACCCTGC TACAAGCATG GTTTCAGGAT TGAGTTCCCC AAGGACCAGG 

601 ACCAGTTCCA CAGAAGGAAA TTTTACCAAA GAAGCATCTA CATACACACT 

651 CACTGTAGAG ACCACAAGTG GCCCAGTCAC TGAGAAGTAC ACAGTCCCCA 

701 CTGAGACCTC AACAACTGAA GGTGACAGCA Ci .jAGACCOC CTGGGACACA 

751 AGATATATTC CTGTAAAAAT CACATCTCCA ATGAAAACAT TTGCAGATTC 

801 AACTGCATCC AAGGAAAATG CCCCAGTGTC TATGACTCCA GCTGAGACCA 

851 CAGTTACTGA CTCACATACT CCAGGAAGQA CAAACCCATC ATTTGGGACA 

901 CTTTATTCTT CCTTCCTTGA CCTATCACCT AAAGGGACCC CAAATTCCAG 

951 AGGTGAAACA AGCCTGGAAC TGATTCTATC AACCACTGGA TATCCCTTCT 

1001 CCTCTCCTGA ACCTGGCTCT GCAGGACACA GCAGAATAAG TACCAfiTGCG 

1051 CCTTTGTCAT CATCTGCTTC AGTTCTCGAT AATAAAATAT CAGAGACCAG 

1101 CATATTCTCA GGCCAGAGTC TCACCTCCCC TCTGTCTCCT GGGGTGCCCG 

1151 AGGCCAGAGC CAGCACAATG CCCAACTCAG CTATCCCTTT TTCCATGACA 

1201 CTAAGCAATG CAGAAACAAG TGCCGAAAGG GTCAGAA6CA CAATTTCCTC 
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Human cDNA of CA125 
(SEQIDNO:4) 



1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 



TCTGGGGACT CCATCAATAT CCACAAAGCA GACAGCAGAG ACTATCCTTA 
CCTTCCATGC CTTCGCTGAG ACCATGGATA TACCCAGCAC CCACATAGCC 
AAGACTTTGG CTTCAGAATG GTTGGGAAGT CCAGGTACCC TTGGTGGCAC 
CAGCACTTCA GCGCTGACAA CCACATCTCC ATCTACCACT TTAGTCTCAG 
AGGAGACCAA CACCCATCAC TCCACGAGTG GAAAGGAAAC AGAAGGAACT 
TTGAATACAT CTATGACTCC ACTTGAGACC TCTGCTCCTG GAGAAGAGTC 
CGAAATGACT GCCACCTTGG TCCCCACTCT AGGTTTTACA ACTCTTGACA 
GCAAGATCAG AAGTCCATCT CAGGTCTCTT CATCCCACCC AACAAGAGAG 
CTCAGAACCA CAGGCAGCAC CTCTGGGAGG CAGAGTTCCA GCACAGCTGC 
CCACGGGAGC TCTGACATCC TGAGGGCAAC CACTTCCAGC ACCTCAAAAG 
CATCATCATG GACCAGTGAA AGCACAGCTC AGCAATTTAG TGAACCCCAG 
CACACACAGT GGGTGGAGAC AAGTCCTAGC ATGAAAACAG AGAGACCCCC 
AGCATCAACC AGTGTGGCAG CCCCTATCAC CACTTCTGTT CCCTCAGTGG 
TCTCTGGCTT CACCACCCTG AAGACCAGCT CCACAAAAGG GATTTGGCTT 
GAAGAAACAT CTGCAGACAC ACTCATCGGA GAATCCACAG CTGGCCCAAC 
CACCCATCAG TTTGCTGTTC CCACTGGGAT TTCAATGACA GGAGGCAGCA 
GCACCAGGGG AAGCCAGGGC ACAACCCACC TACTCACCAG AGCCACAGCA 
TCATCTGAGA CATCCGCAGA TTTGACTCTG GCCACGAACG GTGTCCCAGT 
CTCCGTGTCT CCAGCAGTGA GCAAGACGGC TGCTGGCTCA AGTCCTCCAG 
GAGGGACAAA GCCATCATAT ACAATGGTTT CTTCTGTCAT CCCTGAGACA 
TCATCTCTAC AGTCCTCAGC TTTCAGGGAA GGAACCAGCC TGGGACTGAC 
TCCATTAAAC ACTAGACATC CCTTCTCTTC CCCTGAACCA GACTCTGCAG 
GACACACCAA GATAAGCACC AGCATTCCTC TGTTGTCATC TGCTTCAGTT 
CTTGAGGATA AAGTGTCAGC GACCAGCACA TTCTCACACC ACAAAGCCAC 
CTCATCTATT ACCACAGGGA CTCCTGAAAT CTCAACAAAG ACAAA6CCCA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNOM) 



2501 GCTCAGCCGT TCTTTCCTCC ATGACCCTAA GCAATGCAGC AACAAGTCCT 

2551 GAAA6AGTCA GAAATGCAAC TTCCCCTCTG ACTCATCCAT CTCCATCAGG 

2601 GGAAGAGACA GCAGGGAGTG TCCTCACTCT CAGCACCTCT GCTGAGACTA 

2651 CAGACTCACC TAACATCCAC CCAACTGGGA CACTGACTTC AGAATCGTCA 

2701 GAGAGTCCTA GCACTCTCAG CCTCCCAAGT GTCTCTGGAG TCAAAACCAC 

2751 ATTTTCTTCA TCTACTCCTT CCACTCATCT ATTTACTAGT GGAGAAGATU^ 

2801 CAGAGGAAAC TTCGAATCCA TCTGTGTCTC AACCTGAGAC TTCTGTTTCC 

2851 AGAGTAAGGA CCACCTTGGC CAGCACCTCT GTCCCTACCC CAGTATTCCC 

2901 CACCATGGAC ACCTGGCCTA CACGTTCAGC TCAGTTCTCT TCATCCCACC 

2951 TAGTGA6TGA GCTCAGAGCT ACGAGCAGTA CCTCAGTTAC AAACTCAACT 

3001 GGTTCAGCTC TTCCTAAAAT ATCTCACCTC ACTGGGACGG CAACAATGTC 

3051 ACAGACCAAT AGAGACACGT TTAATGACTC TGCTGCACCC CAAAGCACAA 

3101 CTTGGCCAGA GACTAGTCCC AGATTCAAGA CAGGGTTACC TTCAGCAACA 

3151 ACCACTGTTT CAACCTCTGC CACTTCTCTC TCTGCTACTG TAATGGTCTC 

3201 TAAATTCACT TCTCCAGCAA CTAGTTCCAT GGAAGCAACT TCTATCAGGG 

3251 AACCATCAAC AACCATCCTC ACAACAGAGA CCACGAATGG CCCAGGCTCT 

3301 ATGGCTGTGG CTTCTACCAA CATCCCAATT GGAAAGGGCT ACATTACTGA 

3351 AGGAAGATTG GACACAAGCC ATCTGCCCAT TGGAACCACA GCTTCCTCTG 

3401 AGACATCTAt" GGAT^ ATGGCCAAAG AAAGTGTCTC AATGTCAGTA 

3451 TCTCCATCTC AGTCCATGGA TGCTGCTGGC TCAAGCACTC CAGGAAGGAC 

3501 AAGCCAATTC GTTGACACAT TTTCTGATGA TGTCTATCAT TTAACATCCA 

3551 GAGAAATTAC AATACCTAGA GATGGAACAA GCTCAGCTCT GACTCCACAA 

3601 ATGACTGCAA CTCACCCTCC ATCTCCTGAT CCTGGCTCTG CTAGAAGCAC 

3651 CTGGCTTGGC ATCTTGTCCT CATCTCCTTC TTCTCCTACT CCCAAAGTCA 

3701 CAATGAGCTC CACATTTTCA ACTCAGAGAG TCACCACAAG CATGATAATG 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQroNO:4) 



3751 GACACACTTG AAACTAGTCG GTGGAACATG CCCAACTTAC CTTCCACX3AC 

3801 TTCCCTGACA CCAAGTAATA TTCCAACAAG TGGTGCCATA GGAAAAAGCA 

3851 CCCTGGTTCC CTTGGACACT CCATCTCCAG CCACATCATT GGAGGCATCA 

3901 GAAGGGGGAC TTCCAACCCT CAGCACCTAC CCTGAATCAA CAAACACACC 

3951 CAGCATCCAC CTCGGAGCAC ACGCTAGTTC AGAAAGTCCA AGCACCATCA 

4001 AACTTACCAT GGCTTCA6TA GTAAAACCTG GCTCTTACAC ACCTCTCACC 

4051 TTCCCCTCAA TAGAGACCCA CATTCATGTA TCAACAGCCA GAATGGCTTA 

4101 CTCTTCTGGG TCTTCACCTG AGATGACAGC TCCTGGAGAG ACTAACACTG 

4151 GTAGTACCTG GGACCCCACC ACGTACATCA CCACTACGGA TCCTAAGGAT 

4201 ACAAGTTCAG CTCAGGTCTC TACACCCCAC TCA6TQAGGA CACTCAGAAC 

4251 CACAGAAAAC CATCCAAAGA CAGAGTCCGC CACCCCAGCT GCTTACTCTG 

4301 GAAGTCCTAA AATCTCAAQT TCACCCAATC TCACCAGTCC GGCCACAAAA 

4351 GCATGGACCA TCACAGACAC AACTGAACAC TCCACTCAAT TACATTACAC 

4401 AAAATTGGCA GAAAAATCAT CTGGATTTGA GACACAGTCA GCTCCAGGAC 

4451 CTGTCTCTGT AGTAATCCCT ACCTCCCCTA CCATTGGAAG CAGCACATTG 

4501 GAACTAACTT CT6ATGTCCC AGGGGAACCC CTGGTCCTTG CTCCCAGT6A 

4551 GCA6ACCACA ATCACTCTCC CCATGGCAAC ATGGCTGAGT ACCAGTTTGA 

4601 CAGAGGAAAT GGCTTCAACA GACCTTGATA TTTCAAGTCC AAGTTCACCC 

4651 ATCAGTACAT TTGCTATTTT TCCACCTATG TCCACACCTT CTCATGAACT 

4701 TTCAAAGTCA GAGGCAGATA CCA6TGCCAT TAGAAATACA GATTCAACAA 

4751 CGTTGQATCA GCACCTAGGA ATCAGGAGTT TGGGCAGAAC TGGGGACTTA 

4801 ACAACTGTTC CTATCACCCC ACTGACAACC ACX3TGGACCA GTGTGATTGA 

4851 ACACTCAACA CAAGCACAGG ACACCCTTTC TGCAACXSATG AGTCOTACTC 

4901 ACGTGACACA GTCACTCAAA GATCAAACAT CTATACCAGC CTCAGCATCC 

4951 CCTTCCCATC TTACTGAAGT CTACCCTGAG CTCGGGACAC AAGGGAGAAG 
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Human cDNA of CA125 
(^QroNO:4) 



5001 CTCCTCTGAG GCAACCACTT TTTGGAAACC ATCTACAGAC ACACTGTCCA 
5051 GAGAGATTGA GACTGGCCCA ACAAACATTC AATCCACTCC ACCCATGGAC 
5101 AACACAACAA CAGGGAGCAG TAGTAGTGGA 6TCACCCTGG GCATAGCCCA 
5151 CCTTCCCATA GGAACATCCT CCCCAGCTGA GACATCCACA AACATGGCAC 
5201 TGGAAAGAAG AAGTTCTACA GCCACTGTCT CTATGGCTG6 GACAATGGGA 
5251 CTCCTTGTTA CTAGTGCTCC AGGAAGAAGC ATCAGCCAGT CATTAGGAAG 
5301 AGTTTCCTCT GTCCTTTCTG AGTCAACTAC TGAAGGAGTC ACAGATTCTA 
5351 GTAAGGGAA6 CAGCCCAAGG CTGAACACAC AGGGAAATAC AGCTCTCTCC 
5401 TCCTCrCTTG AACCCAGCTA TGCTGAAGGA AGCCAGATGA GCACAAGCAT 
5451 CCCTCTAACC TC^TCTCCTA CAACTCCTGA TGTGGAATTC ATAGGGGGCA 
5501 GCACATTTTG GACCAAGGAG GTCACCACAG TTATGACCTC AGACATCTCC 
5551 AAGTCTTCAG CAAGGACAGA GTCCAGCTCA GCTACCCTTA TGTCCACAGC 
5601 TTTGGGAAGC ACTGAAAATA CAGGAAAAGA AAAACTGAGA ACTGCCTCTA 
5651 TGGATCTTCC ATCTCCAACT CCATCAATGG AGGTGACACC ATGGATTTCT 
5701 CTCACTCTCA GTAATGCCCC CAATACCACA 6ATTCACTTG ACCTCAGCCS^ 
5751 TGGGGTGCAC ACCAGCTCTG CAGGGACTTT GGCCACTGAC AGGTCATTGA 
5801 ATACTGGTGT CACTAGAGCC TCCAGATTGG AAAACGGCTC TGATACCTCT 
5851 TCTAAGTCCC TGTCTATGGG AAACAGCACT CACACTTCCA TGACTGACAC 
5901 AGAGAAGAGT GAAGTSTCTT CTTCAATCCA TdCCdGACCT GAGACCTCAG 
5951 CTCCTGGAGC A6AGACCACT TTGACTTCCA CTCCTGGAAA CAGGGCCATA 
6001 AGCTTAACAT TGCCTTTTTC ATCCATTCCA 6TGGAAGAA6 TCATTTCTAC 
6051 AGGCATAACC TCAGGACCAG ACATCAACTC AGCACCCATG ACACATTCTC 
6101 CCATCACCCC ACCAACAATT GTATGGACCA GTACAGGCAC AATTGAACAG 
6151 TCCACTCAAC CACTACATGC AGTTTCTTCA GAAAAAGTTT CTGTGGAGAC 
6201 ACAGTCAACT CCATATGTCA ACTCTGTGGC AGTGTCTGCT TCCCCTACCC 
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Table 4 (continued) 



Human cDNA of CA12S 
(SEQroNO:4) 



6251 

6301 

6351 

6401 

6451 

6501 

6551 

6601 

6651 

6701 

6751 

6801 

6851 

6901 

6951 

7001 

7051 

7101 

7151 

7201 

7251 

7301 

7351 

7401 

7451 



ATGAGAATTC AGTCTCTTCT GGAAGCAGCA CATCCTCTCC ATATTCCTCA 
GCCTCACTTG AATCCTTGGA TTCCACAATC AGTAGGAGGA ATGCAATCAC 
TTCCTGGCTA TGGGACCTCA CTACATCTCT CCCCACTACA ACTTGGCCAA 
GTACTAGTTT ATCTGAGGCA CTGTCCTCAG GCCATTCTGG GGTTTCAAAC 
CCAAGTTCAA CTACGACTGA ATTTCCACTC TTTTCAGCTG CATCCACATC 
TGCTGCTAAG CAAAGAAATC CAGAAACAGA GACCCATGGT CCCCAGAATA 
CAGCCGCGAG TACTTTGAAC ACTGATGCAT CCTCGGTCAC AGGTCTTTCT 
GAGACTCCTG TGGGGGCAAG TATCAGCTCT GAAGTCCCTC TTCCAATGGC 
CATAACTTCT AGATCAGATG TTTCTGGCCT TACATCTGAG AGTACTGCTA 
ACCCGAGTTT AGGCACAGCC TCTTCAGCAG GGACCAAATT AACTAGGACA 
ATATCCCTGC CCACTTCAGA GTCTTTGGTT TCCTTTAGAA TGAACAAGGA 
TCCATGGAC7V GTGTCAATCC CTTTGGGGTC CCATCCAACT ACTAATACAG 
AAACAAGCAT CCCAGTAAAC AGCGCAGGTC CACCTGGCTT GTCCACAGTA 
GCATCAGATG TAATTGACAC ACCTTCAGAT GGGGCTGAGA GTATTCCCAC 
TGTCTCCTTT TCCCCCTCCC CTGATACTGA AGTGACAACT ATCTCACATT 
TCCCAGAAAA GACAACTCAT TCATTTAGAA CCATTTCATC TCTCACTCAT 
GAGTTGACTT CAAGAGTGAC ACCTATTCCT GGGGATTGGA TGAGTTCAGC 
TATGTCTACA AAGCCCACAG GAGCCAGTCC Ci:CCATTACA CTGGGAGAGA 
GAAGGACAAT CACCTCTGCT GCTCCAACCA CTTCCCCCAT AGTTCTCACT 
GCTAGTTTCA CAGAGACCAG CACAGTTTCA CTGGATAATG AAACTACAGT 
AAAAACCTCA GATATCCTTG ACGCACGGAA AACAAATGAG CTCCCCTCAG 
ATAGCAGTTC TTCTTCTGAT CTGATCAACA CCTCCATAGC TTCTTCAACT 
ATGGATGTCA CTAAAACAGC CTCCATCAGT CCCACTAGCA TCTCAGGAAT 
GACAGCAAGT TCCTCCCCAT CTCTCTTCTC TTCAGATAGA CCCCAGGTTC 
CCACATCTAC 7UVCAGAGACA AATACAGCCA CCTCTCCATC TGTTTCCAGT 
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Table 4 (continued) 

Human cDNA of CA125 
(SEQIDNO:4) 

7501 TVACACCTATT CTCTTGATGG GGGCTCCAAT GTGGGTGGCA CTCCATCCAC 

7551 TTTACCACCC TTTACAATGA CCCACCCTGT CGAGACAAGC TCGGCCCTAT 

7601 TAGCCTGGTC TAGACCAGTA AGAACTTTCA GCACCATGGT CAGCACTGAC 

7651 ACTGCCTCCG GAGAAAATCC TACCTCTAGC AATTCTGTGG TGACTTCTGT 

7701 TCCAGCACCA GGTACATGGG CCAGTGTAGG CAGTACTACT GACTTACCTG 

7751 CCATGGGCTT TCTCAAGACA AGTCCTGCAG GAGAGGCACA CTCACTTCTA 

7801 GCATCAACTA TTGAACCAGC CACTGCCTTC ACTCCCCATC TCTCAGCAGC 

7851 AGTGGTCACT GGATCCAGTG CTACATCAGA AGCCAGTCTT CTCACTACGA 

7901 GTGAAAGCAA AGCCATTCAT TCTTCACCAC AGACCCCAAC TACACCCACC 

7951 TCTGGAGCAA ACTGGGAAAC TTCAGCTACT CCTGAGAGCC TTTTGGTAGT 

8001 CACTGAGACT TCAGACACAA CACTTACCTC AAAGATTTTG GTCACAGATA 

8051 CCATCTTGTT TTCAACTGTG TCCACGCCAC CTTCTAAATT TCCAAGTACG 

8101 GGGACTCTGT CTGGAGCTTC CTTCCCTACT TTACTCCCGG ACACTCCAGC 

8151 CATCCCTCTC ACTGCCACTG AGCCAACAAG TTCATTAGCT ACATCCTTTG 

8201 ATTCCACCCC ACTGGTGACT ATAGCTTCGG ATAGTCTTGG CACAGTCCCA 

8251 GAQACTACCC TGACCATbTC AGAQACCTCA AATGGTGATG CACTGGTTCT 

8301 TAAGACAGTA AGTAACCCAG ATAGGAGCAT CCCTGGAATC ACTATCCAAG 

8351 GAGTAACAGA AAGTCCACTC CATCCTTCTT CCACTTCCCC CTCTAAGATT 

8401 GTTGCTCCAC GGAATACAAC CTATGAAGGT TCGATCACAG TGGCACTTTC 

8451 TACTTTGCCT GCGGGAACTA CTGGTTCCCT TGTATTCAGT CA6AGTTCTG 

8501 AAAACTCAGA GACAACGGCT TTGGTAGACT CATCAGCTGG GCTTGAGAGG 

8551 GCATCTGTGA TGCCACTAAC CACAGGAAGC CAGGGTATGG CTAGCTCTGG 

8601 AGGAATCAGA ACTGGGTCCA CTCACTCAAC TGGAACCAAA ACATTTTCTT 

8651 CTCTCCCTCT GACCATGAAC CCAGGTGAGG TTACAGCCAT GTCTGAAATC 

8701 ACCACGAACA GACTGACAGC TACTCAATCA ACAGCACCCA AAGGGATACC 
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1 able 4 (continued) 



Human cDNA ofCA125 
(SEQIDNO:4) 



8751 

8801 

8851 

8901 

8951 

9001 

9051 

9101 

9151 

9201 

9251 

9301 

9351 

9401 

9451 

9501 

9551 

9601 

9651 

9701 

9751 

9801 

9851 

9901 

9951 



TGTGAAGCCC ACCAGTGCTG AGTCAGGCCT CCTAACACCT GTCTCTGCCT 
CCTCAAGCCC ATCAAAGGCC TTTGCCTCAC TGACTACAGC TCCCCCATCA 
ACTTGGGGGA TCCCACAGTC TACCTTGACA TTTGAGTTTT CTGAGGTCCC 
AAGTTTGGAT ACTAAGTCCX3 CTTCTTTACC AACTCCTGGA CAGTCCCTGA 
ACACCATTCC AGACTCAGAT GCAAGCACAG CATCTTCCTC ACTGTCCAAG 
TCTCCAGAAA AAAACCCAAG GGCAAGGATG ATGACTTCCA CAAAGGCCAT 
AAGTGCAAGC TCATTTCAAT CAACAGGTTT TACTQAT^CC CCTGAGGGAT 
CTGCCTCCCC TTCTATGGCA GGGCATGAAC CCAGAGTCCC CACTTCAGGA 
ACAGGGGACC CTAGATATGC CTCAGAGAGC ATGTCTTATC CAGACCCAAG 
CAAGGCATCA TCAGCTATGA CATCGACCTC TCTTGCATCA AAACTCACAA 
CTCTCTTCAG CACAGGTCT^ GCAGCAAGGT CTGGTTCTAG TTCCTCTCCC 
ATAAGCCTAT CCACTGAGAA AGAAACAAGC TTCCTTTCCC CCACTGCATC 
CACCTCCAGA AAGACTTCAC TATTTCTTGG GCCTTCCATG GCAAGGCAGC 
CCAACATATT GGTGCATCTT OVGACTTCAG CTCTGACACT TTCTCCAACA 
TCCACTCTAA ATATGTCCCA GGAGGAGCCT CCTGAGTTAA CCTCAAGCCA 
GACCATTGCA GAAGAAGAGG GAACAACAGC TGAAACACAG ACGTTAACCT 
TCACACCATC TGAGACCCCA ACATCCTTGT TACCTGTCTC TTCTCCCACA 
GAACCCACAG CCAGAAGAAA GAGTTCTCCA GAAACATGGG CAAGCTCTAT 
TtCAGTTCCT GCCAAGACCT CCTTGGTTGA AACAACTGAT GGAACGCTAG 
TGACCACCAT AAAGATCTCA AGCCAQGCAG CACAAGGAAA TTCCACGTGG 
CCTGCCCCAG CAGAGGAGAC GGGGACCAGT CCAGCAGGCA CATCCCCAGG 
AAGCCCAGAA GTGTCTACCA CTCTCAAAAT CATGAGCTCC AAGGAACCCA 
GCATCAGCCC AGAGATCAGG TCCACTGTGC GAAATTCTCC TTGGAAGACT 
CCAGAAACAA CTGTTCCCAT GGAGACCACA GTGGAACCAG TCACCCTTCA 
GTCCACAGCC CTAGGAAGTG GCAGCACCAG CATCTCTCAC CTGCCCACAG 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



10001 GAACCACATC ACCAACCAAG TCACCAACAG AAAATATGTT GGCTACAGAA 

10051 AGGGTCTCCC TCTCCCCATC CCCACCTGAG GCTTGGACCA ACCTTTATTC 

10101 TGGAACTCCA GGAGGGACCA GGCAGTCACT GGCCACAATG TCCTCTGTCT 

10151 CCCTAGAGTC ACCAACTGCT AGAAGCATCA CAGGGACTGG TCAGCAAAGC 

10201 AGTCCAGAAC TGGTTTCAAA GACAACTGGA ATGGAATTCT CTATGTGGCA 

10251 TGGCTCTACT GGAGGGACCA CAGGGGACAC ACATGTCTCT CTGAGCACAT 

10301 CTTCCAATAT CCTTGAAGAC CCTGTAACCA GCCCAAACTC TGTGAGCTCA 

10351 TTGACAGATA AATCCAAACA TAAAACCGAG ACATGGGTAA GCACCACAGC 

10401 CATTCCCTCC ACTGTCCTGA ATAATAAGAT AATGGCAGCT GAACAACAGA 

10451 CAAGTCGATC TGTGGATGAG GCTTATTCAT CAACTAGTTC TTGGTCAGAT 

10501 CAGACATCTG GGAGTGACAT CACCCTTGGT GCATCTCCTG ATGTCACAAA 

10551 CACATTATAC ATCACCTCCA CAGCACAAAC CACCTCACTA GTGTCTCTGC 

10601 CCTCTGGAGA CCAAGGCATT ACAAGCCTCA CCAATCCCTC AGGAGGAAAA 

10651 ACAAGCTCTG CGTCATCTGT CACATCTCCT TCAATAGGGC TTGAQACTCT 

10701 GAGGGCCAAT GTAAGTGCAG TGAAAAGTGA C ^TTGCC^CT ACTGCTGGGC 

10751 ATCTATCTCA GACTTCATCT CCTGCGGAAG TGAGCATCCT GGACGTAACC 

10801 ACAGCTCCTA CTCCAGGTAT CTCCACCACC ATCACCACCA TGGGAACCAA 

10851 CTCAATCTCA ACTACCACAC CCAACCCAGA AGTGGGTATG AGTACCATGG 

ibiSdl ACAGCACCC^ GGCCACAGAG ')^(5GCACAA CTTCTACAC»' ACACCCT^ 

10951 ACCTGGTCTT CCACAGCTGC ATCAGATTCC TGGACTGTCA CAGACATGAC 

11001 TTCAAACTTG AAAGTTGC/^ GATCTCCTGG AACAATTTCC ACAATGCATA 

11051 CAACTTCATT CTTAGCCTCA AGCACTGAAT TAGACTCCAT GTCTACTCCC 

11101 CATGGCCGTA TAACTGTCAT TGGAACCAGC CTGGTCACTC CATCCTCTGA 

11151 TGCTTCAGCT GTAAAGACAG AGACCAGTAC AAGTGAAAGA ACATTGAGTC 

11201 CTTCAGACAC AACTGCATCT ACTCCCATCT CAACTTTTTC TCGTGTCCAG 
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table 4 (continued) 



Human cDNA of CA125 
(S£QIDNO:4) 



11251 
11301 
11351 
11401 
11451 
11501 
11551 
11601 
11651 
11701 
11751 
11801 
11851 
11901 
11951 
12001 
12051 
12101 
12151 
12201 
12251 
12301 
12351 
12401 
12451 



AGGATGAGCA TCTCAGTTCC TGACATTTTA AGTACAAGTT GGACTCCCAG 
TAGTACAGAA GCAGAAGATG TGCCTGTTTC AATGGTTTCT ACAGATCATG 
CTAGTACT^ GACTGACCCA AATACGCCCC TGTCCACTTT TCTGTTTGAT 
TCTCTGTCCA CTCTTGACTG GGACACTGGG AGATCTCTGT CATCAGCCAC 
AGCCACTACC TCAGCTCCTC AGGGGGCCAC AA.CTCCCCAG GAACTCACTT 
TGGAAACCAT GATCAGCCCA GCTACCTCAC AGTTGCCCTT CTCTATAGGG 
CACATTACAA GTGCAGTCAC ACCAGCTGCA ATGGCAAGGA GCTCTGGAGT 
TACTTTTTCA AGACCA&ATC CCACAAGCAA AAAGGCAGAG CAGACTTCCA 
CTCAGCTTCC CACCACCACT TCTGCACATC CAGGGCAGGT GCCCAGATCA 
GCAGCAACAA CTCTGGATGT GATCCCACAC ACAGCAAAAA CTCCAGATGC 
AACTTTTCAG AGACAAGGGC AGACAGCTCT TACAACAGAG GCAAGAGCTA 
CATCTGACTC CTGGAATGAG AAAGAAAAAT CAACCCCAAG TGCACCTTGG 
ATCACTGAGA TQATGAATTC TGTCTCAGAA GATACCATCA AGGAGGTTAC 
CAGCTCCTCC AGTGTATTAA AGGACCCTGA ATACGCTGGA CATAAACTTG 
GAATCTGGGA CGACTTCATC CCCAAGTTTG GAAAAGCAGC CCATATGAGA 
GAGTTGCCCC TTCTGAGTCC ACCACAGGAC AAW3AGGCAA TTCACCCTTC 
TACAAACACA GTAGAGACCA CAGGCTGGGT CACAAGTTCC QAACATGCTT 
CTCATTCCAC TATCCCAGCC CACTCAGCGT CATCCTW^CT CACATCTCCA 
GTGGTTACAA CCTCCACCAG isisi^CAAGCA ATiftGTTTCTA TGTCAACAAC 
CACATGGCCA GAGTCTACAA GGGCTAGAAC AGAGCCTAAT TCCTTCTTGA 
CTATTGAACT GAGGGACGTC AGCCCTTACA TGGACACCAG CTCAACCACA 
CAAACAAGTA TTATCTCTTC CCCAGGTTCC ACTGCGATCA CCAAGGGGCC 
TAGAACAGAA ATTACCTCCT CTAAGAGAAT ATCCAGCTCA TTCCTTGCCC 
AGTCTATGAG GTCGTCAGAC AGCCCCTCAG AAGCCATCAC CAGGCTGTCT 
AACTTTCCTG CCATGACAGA ATCTGGAGGA ATGATCCTTG CTATGCAAAC 
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Table 4 (continued) 



Human cDNA of CA125 
(S£QIDNO:4) 



12501 AAGTCCACCT GGCGCTACAT CACTAAGTGC ACCTACTTTG GATACATCAG 

12551 CCACAGCCTC CTGGACAGGG ACTCCACTGG CTACGACTCA GAGATTTACA 

12601 TACTCAGAGA AGACCACTCT CTTTAGCAAA GGTCCTGAGG ATACATCACA 

12651 GCCAAGCCCT CCCTCTGTGG AAGAAACCAG CTCTTCCTCT TCCCTGGTAC 

12701 CTATCCATGC TACAACCTCG CCTTCCAATA TTTTGTTGAC ATCACAAGGG 

12751 CACAGTCCCT CCTCTACTCC ACCTGTGACC TCAGTTTTCT TGTCTGAGAC 

12801 CTCTGGCCT6 GGGAAGACCA CAGACATGTC GAGGATAAGC TTGGAACCT6 

12851 GCACAAGTTT ACCTCCCAAT TTGAGCAGTA CAGCAGGTGA GGCGTTATCC 

12901 ACTTATGAAG CCTCCAGAGA TACAAAGGCA ATTCATCATT CTGCAGACAC 

12951 AGCAGTGACG AATATGGAGG CAACCAGTTC TGAATATTCT CCTATCCCAG 

13001 GCCATACAAA GCCATCCAAA GCCACATCTC CATTGGTTAC CTCCCACATC 

13051 ATGGGGGACA TCACTTCTTC CACATCAGTA TTTGGCTCCT CCGAGACCAC 

13101 AGAGATTGAG ACAGTGTCCT CTGTGAACCA GGGACTTCAG GAGAGAAGCA 



13151 CATCCCAGGT GGCCAGCTCT GCTACAGAGA CAAGCACTGT CATTACCCAT 

13201 GTGTCTAGTG GTGATGCTAC TACTCATGTC ACCAAGACAC AAGCCACTTT 

13251 CTCTAGCGGA ACATCCATCT CAAGCCCTCA TCAGTTTATA ACTTCTACCA 

13301 ACACATTTAC AGATGTGAGC ACCAACCCCT CCACCTCTCT GATAATGACA 

13351 GAATCTTCAG GAGTGACCAT CACCACCCAA ACAGGTCCTA CTGGAGCTGC 

13401 AACACAGGGT CCATATCTCT TGGACACATC AACCATGCCT TACTTGACAG 

13451 AGACTCCATT AGCTGTGACT CCAGATTTTA TGCAATCAGA GAAGACCACT 

13501 CTCATAAGCA AAGGTCCCAA GGATGTGACC TGGACAAGCC CTCCCTCTGT 

13551 GGCAGAAACC AGCTATCCCT CTTCCCTGAC ACCTTTCTTG GTCACAACCA 

13601 TACCTCCTGC CACTTCCACG TTACAAGGGC T^CATACATC CTCTCCTGTT 

13651 TCTGCGACTT CAGTTCTTAC CTCTGGACTG GTGAAGACCA CAGATATGTT 

13701 GAACACAAGC ATGGAACCTG TGACCAATTC ACCTCAAAAT TTGAACAATC 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



13751 
13801 
13851 
13901 
13951 
14001 
14051 
14101 
14151 
14201 
14251 
14301 
14351 
14401 
14451 
14501 
14551 
14601 
14651 
14701 
14751 
14801 
14851 
14901 
14951 



CATCAAATGA C3ATACTGGCC ACTTTGGCAG CCACCACAGA TATAGAGACT 
ATTCATCCTT CCATAAACAA AGCAGTGACC AATATGGGGA CTGCCAGTTC 
AGCACATGTA CTGCATTCCA CTCTCCCAGT CAGCTCAGAA CCATCTACAG 
CCACATCTCC AATGGTTCCT GCCTCCAGCA TGGGGGACGC TCTTGCTTCT 
ATATCAATAC CTGGTTCTGA GACCACAGAC ATTGAGGGAG AGCCAACATC 
CTCCCTGACT GCTGGACGAA AAGAGAACAG CACCCTCCAG GAGATGAACT 
CAACTACAGA GTCAAACATC ATCCTCTCCA ATGTGTCTGT GGGGGCTATT 
ACTGAAGCCA CAAAAATGGA AGTCCCCTCT TTTGATGCAA CATTCATACC 
AACTCCTGCT CAGTCAACAA AGTTCCCAGA TATTTTCTCA GTAGCCAGCA 
GTAGACTTTC AAACTCTCCT CCCATGACAA TATCTACCCA CATGACCACC 
ACCCAGACAG GGTCTTCTGG AGCTACATCA AAGATTCCAC TTGCCTTAGA 
CACATCAACC TTGGAAACCT CAGCAGGGAC TCCATCAGTG GTGACTGAGG 
GGTTTGCCCA CTCAAAAATA ACCACTGCAA TGAACAATGA TGTCAAGGAC 



GTGTCACAGA CAAACCCTCC CTTTCAGGAT GAAGCCAGCT CTCCCTCTTC 
TCAAGCACCT GTCCTTGTCA CAACCTTACC TTCTTCTGTT GCTTTCACAC 
CGCAATGGCA CAGTACCTCC * TCTCCTGTTT CTATGTCCTC AGTTCTTACT 

Tcrra^cTGG taaagaccgc aggcaaggtg gatacaagct tagaaacagt 

GACCAGTTCA CCTCAAAGTA TGAGCAACAC TTTGGATGAC ATATCGGTCA 

cttcagcagc caccacagat atagagacaa cgcatccttc cataaacaca 
gtagttacct^ atgtggggac caccggttca gcatttgaat cacattctac 
tgtctcagct tacccagagc CATCTAAAGT cacatctcca aatgttacca 
cctccaccat qgaagacacc acaatttccc gatcaatacc taaatcctct 
aagactacaa gaactgagac tgaqacaact tcctccctga ctcctaaact 
gagggagacc agcatctccc aggagatcac ctcgtccaca gagacaagca 
ctgttcctta caaagagctc actggtgcca ctaccgaggt atccaggaca 
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Table 4 (continued) 



Haman cDNA of CA125 
(SEQIDNO:4) 



15001 GATGTCACTT CCTCTAGCAG TACATCCTTC CCTGGCCCTG ATCAGTCCAC 

15051 AGTGTCACTA GACATCTCCA CAGAAACCAA CACCAGGCTG TCTACCTCCC 

15101 CAATAATGAC A6AATCTGCA GAAATAACCA TCACCACCCA AACAGGTCCT 

15151 CATGGGGCTA CATCACAGGA TACTTTTACC ATGGACCCAT CAAATACAAC 

15201 CCCCCAGGCA GGQATCCACT CAGCTATGAC TCATGGATTT TCACAATTGG 

15251 ATGTGACCAC TCTTATQAGC AGAATTCCAC AGGATGTATC ATGGACAAGT 

15301 CCTCCCTCTG TGGATAAAAC CAGCTCCCCC TCTTCCTTTC TGTCCTCACC 

15351 TGCAATGACC ACACCTTCCC TGATTTCTTC TACCTTACCA GAGGATAAGC 

15401 TCTCCTCTCC TATGACTTCA CTTCTCACCT CTGGCCTAGT GAAGATTACA 

15451 GACATATTAC GTACAC6CTT GGAACCTGTG ACCAGCTCAC TTCeAAATTT 

15501 CAGCAGCACC TCAGATAAGA TACTGGCCAC TTCTAAAGAC AGTAAAGACA 

15551 CAAAGGAAAT TTTTCCTTCT ATAAACACAG AAGAGACCAA TGTGAAAGCC 

15601 AACAACTCTG GACATGAATC CCATTCCCCT GCACTQGCTG ACTCaGAGAC 

15651 ACCCAAAGCC ACAACTCAAA TGGTTATCAC CACCACTQTG GGAGATCO^G 

15701 CTCCTTCCAC ATCAATGCCA GTGCATGGTT CCTCTGAGAC TACAAACATT 

15751 AAGAGAGAGC CS^ACATATTT CTTGACTCCT A6ACTGAGAG AGACCAGTAC 

15801 CTCTCAGGAG TCCAGCTTTC CCACGGACAC AAGTTTTCTA CTTTCCAAAG 

15851 TCCCCACTGG TACTATTACT GAGGTCTCCA GTACAGGGGT CAACTPTTCT 

15901 AGCAAAATTT CCACCCCAGA CCS^TGATAAG TCX3W»GTGC CACCTGACAC 

15951 CTTCACAGGA GAGATCCCCA GGGTCTTCAC CTCCTCTATT AAGACAAAAT 



16001 CTGCAGRAAT GACGATCACC ACCCAAGCAA GTCCTCCTGA GTCTGCATC6 

16051 CACAGTACCC TTCCCTTGGA CACATC»ACC ACACTTTCCC AGGGAG6GAC 

16101 TCATTCAACT GTQACTC3«3G GATTCCCATA CTCASAGGTG ACCACTCTCA 

16151 TGGGCATGGG TCCTGGGAAT GTGTCATGGA TQACAACTCC CCCTGTGGAA 

16201 GAAACCAGCT CTGTGTCTTC CCTGATGTCT TCACCTGCCA TGACATCCCC 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDN0:4) 



16251 TTCTCCTGTT TCCTCCACAT CACCACAGAG CATCCCCTCC TCTCCTCTTC 

16301 CTGTGACTGC ACTTCCTACT TCTGTTCTGG TGACAACCAC AGATGTGTTG 

16351 GGCACAACAA GCCCAGAGTC TGTAACCAGT TCACCTCCAA ATTTGAGCAG 

16401 CATCACTCAT GAGAGACCGG CCACTTACAA AGACACTGCA CACACAGAAG 

16451 CCGCCAT6CA TCATTCCACA AACACCGCAG TGACCAATGT AGGGACTTCC 

16501 GGGTCTGQAC ATAAATCACA ATCCTCTGTC CTAGCTGACT CAGAGACATC 

16551 GAAAGCCACA CCTCTGATGA GTACCACCTC CACCCTGGGG GACACAAGTG 

16601 TTTCCACATC AACTCCTAAT ATCTCTCAGA CTAACCAAAT TCAAACAGAG 

16651 CCAACAGCAT CCCTGAGCCC TAGACTGAGG GAGAGCAGCA CGTCTGAGAA 

16701 GACCAGCTCA ACAACAGAGA CAAATACTGC CTTTTCTTAT GTGCCCACAG 

16751 GTGCTATTAC TCAGGCCTCC AGAACAGAAA TCTCCTCTAG CAGAACATCC 

16801 ATCTCAGACC TTGATCGGCC CACAATAGCA CCCGACATCT CCACAGGAAT 

16851 GATCACCAGG CTCTTCACCT CCCCCATCAT GACAAAATCT GCAGAAATGA 

16901 CCGTCACCAC TCAAACAACT ACTCCTGGGG CTACATCACA GGGTATCCTT 

16951 CCTTGGGACA CATCAACCAC ACTTTTCCAG GGAGGGACTC ATTCAACCCT 

17001 GTCTCAGGGA TTCCCACACT CAGAGATAAC CACTCTTCGG AGCAGAACCC 

17051 CTGGAGATGT GTCATGGATG ACAACTCCCC CTGTGGAAGA AACCAGCTCT 

17101 GGGTTTTCCC TGATGTCACC TTCCATGACA TCCCCTTCTC CTCTTTCCTC 

17151 CACATCACCA GAGAGCATCC CCTCCTCTCC TCTCCCTGTG ACTGCACTTC 

17201 TTACTTCTGT TCTGGTGACA ACCACCAATG TATTGGGCAC AACAAGCCCA 

17251 GAGACCGTAA CGAGTTCACC TCCAAATTTA AGCA6CCCCA CACAGGAGAG 

17301 ACTGACCACT TACAAAGACA CTGCGCACAC AGAAGCCATG CATGCTTCCA 

17351 TGGATACAAA CACTGCAGTG GCCAACGTCG GGACCTCCAT TTCTGGACAT 

17401 GAATCACAAT CTTCTGTCCC AGCTGATTCA CACACATCCA AAGCCACATC 

17451 TCCAATGGGT ATCACCTTCG CCATGGGGGA TACAAGTGTT TCTACATCAA 
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Table 4 (continued) 



Human cDNA ofCA125 
(SEQIDNO:4) 



17501 CTCCTGCCTT CTTTGAGACT AGAATTCAGA CTGAATCAAC ATCCTCTTTG 

17551 ATTCCTGGAT TAAGGGACAC CAGGACGTCT GAGGAGATCA ACACTGTGAC 

17601 AGAGACCAGC ACTGTCCTTT CAGAAGTGCC CACTACTACT ACTACTGAGG 
* 

17651 TCTCCAGGAC AGAAGTTATC ACTTCCAGCA GAACAACCAT CTCAGGGCCT 

17701 GATCATTCCA AAATGTCACC CTACATCTCC ACAGAAACCA TCACCAGGCT 

17751 CTCCACTTTT CCTTTTGTAA CAGGATCCAC AGAAATGGCC ATCACCAACC 

17801 AAACAGGTCC TATAGGGACT ATCTCACAGG CTACCCTTAC CCTGGACACA 

17851 TC7VAGCACAG CTTCCTGGGA AGGGACTCAC TCACCTGTGA CTCAGAGATT 

17901 TCCACACTCA GAGGAGACCA CTACTATGAG CAGAAGTACT AAGGGCGTGT 

17951 CATGGCAAAG CCCTCCCTCT GTGGAAGAAA CCAGTTCTCC TTCTTCCCCA 

18001 GTGCCTTTAC CTGCAATAAC CTCACATTCA TCTCTTTATT CCGCAGTATC 

18051 AGGAAGTAGC CCCACTTCTG CTCTCCCTGT GACTTCCCTT CTCACCTCTG 

18101 GCAGGAGGAA GACCATAGAC ATGTTGGACA CACACTCAGA ACTTGTGACC 

18151 AGCTCCTTAC CAAGTGCAAG TAGCTTCTCA GGTGAGATAC TCACTTCTGA 

18201 AGCCTCCACA AATACAGAGA CAATTCACTT TTCAGAGAAC ACAGCAGAAA 

18251 CCAATATGGG GACCACCAAT TCTATGCATA AACTACATTC CTCTGTCTCA 

18301 ATCCACTCCC AGCCATCCGG ACACACACCT CCAAAGGTTA CTGGATCTAT 

18351 GATGGAGGAC GCTATTGTTT CCACATCAAC ACCTGGTTCT CCTGAGACTA 

18401 AAAATGTTGA CAGAGACTCA ACATCCCCTC TGACTCCTGA ACTGAAAGAG 

18451 GACAGCACCX3 CCCTGGTGAT GAACTCAACT ACAGAGTCAA ACACTGTTTT 

18501 CTCCAGTGTG TCCCTGGATG CTGCTACTGA GGTCTCCAGG GCAGAAGTCA 

18551 CCTACTATGA TCCTACATTC ATGCCAGCTT CTGCTCAGTC AACAAAGTCC 

18601 CCAGACATTT CACCTGAAGC CAGCAGCAGT CATTCTAACT CTCCTCCCTT 



18651 GACAATATCT ACACACAAGA CCATCGCCAC ACAAACAGGT CCTTCTGGGG 
18701 TGACATCTCT TGGCCAACTG ACCCTGGACA CATCAACCAT AGCCACCTCA 
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Table 4 (continued) 



Human cDNA of CA125 
(S£QIDNO:4) 



18751 
18801 
18851 
18901 
18951 
19001 
19051 
19101 
19151 
19201 
19251 
19301 
19351 
19401 
19451 
19501 
19551 
19601 
19651 
19701 
19751 
19801 
19851 
19901 
19951 



GCAGGAACTC CATCAGCCAG AACTCAGGAT TTTGTAGATT CAGAAACAAC 
CAGTGTCATG AACAATGATC TCAATGATGT GTTGAAGACA AGCCCTTTCT 
CTGCAGAAGA AGCCAACTCT CTCTCTTCTC AGGCACCTCT CCTTGTGACA 
ACCTCACCTT CTCCTGTAAC TTCCACATTG CAAGAGCACA GTACCTCCTC 
TCTTGTTTCT GTGACCTCAG TACCCACCCC TACACTGGCG AAGATCACAG 
ACATGGACAC AAACTTA6AA CCTGTGACTC GTTCACCTCA AAATTTAAGG 
AACACCTTGG CCACTTCAGA AGCCACCACA GATACACACA CAATGCATCC 
TTCTATAAAC ACAGCAATGG CCAATGTGGG GACCACCAGT TCACCAT^TG 
AATTCTATTT TACTGTCTCA CCTGACTCAG ACCCATATAA AGCCACATCC 
GCAGTAGTTA TCACTTCCAC CTCGGGGGAC TCAATAGTTT CCACATCAAT 
GCCTAGATCC TCTGCGATGA AAAAGATTGA GTCTGAGACA ACTTTCTCCC 
TGATATTTAG ACTGAGGGAG ACTAGCACCT CCCAGAAAAT TGGCTCATCC 
TCAGACACAA GCACGGTCTT TGACAAAGCA TTCACTGCTG CTACTACTGA 
GGTCTCCAGA ACAGAACTCA CCTCCTCTAG CAGAACATCC ATCCAAGGCA 
CTGAAAAGCC CACAATGTCA CCGGACACCT CCACAAGATC TGTCACCATG 
CTTTCTACTT TTGCTGGCCT GACAAAATCC GAAGAAAGGA CCATTGCCAC 
CCAAACAGGT CCTCATAGGG CGACATCACA GGGTACCCTT ACCTGGGACA 
CATCAATCAC AACCTCACAG GCAGGGACCC ACTCAGCTAT GACTCATGGA 
TTTTCACAAT TAGATTTGTC CACTCTTACG A6TAGAGTTC CTGAGTACAT 
ATCAGGGACA AGCCCACCCT CTGTGGAAAA AACCAGCTCT TCCTCTTCCC 
TTCTGTCTTT ACCAGCAATA ACCTCACCGT CCCCTGTACC TACTACATTA 
CCAGAAAGTA GGCCGTCTTC TCCTGTTCAT CTGACTTCAC TCCCCACCTC 
TGGCCTAGTG AAGACCACAG ATATGCTGGC ATCTGTGGCC AGTTTACCTC 
CAAACTTGGG CAGCACCTCA CATAAGATAC CGACTACTTC AGAAGACATT 
AAAGATACAG AGAAAATGTA TCCTTCCACA AACATAGCAG TAACCAATGT 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



20001 GGGGACCACC ACTTCTGAT^ AGGAATCTTA TTCGTCTGTC CCAGCCTACT 

20051 CAGAACCACC CAAA6TCACC TCTCCAATGG TTACGTCTTT CAACATAAGG 

20101 GACACCATTG TTTCCACATC CATGCCTGGC TCCTCTGAGA TTACAAGGAT 

20151 TGAGATGGAG TCAACATTCT CCGTGGCTCA TGGGCTGAAG GGAACCAGCA 

20201 CCTCCCAGGA CCCCATCGTA TCCACAGAGA AT^GTGCTGT CCTTCACAAG 

20251 TTGACCACTG GTGCTACTGA GACCTCTAGG ACAGAAGTTG CCTCTTCTAG 

'203 01 AAGAACATCC ATTCCAGGCC CTGATCATTC CACAGAGTCA CCAGACATCT 

20351 CCACTGAAGT GATCCCCAGC CTGCCTATCT CCCTTGGCAT TACAGAATCT 

20401 TCAAATATGA CCATCATCAC TCGAACAGGT CCTCCTCTTG GCTCTACATC 

20451 ACAGGGCACA TTTACCTTGG ACACACCAAC TACATCCTCC AGGGCAGGAA 

20501 CACACTCGAT GGCGACTCAG GAATTTCCAC ACTCAGAAAT GACCACTGTC 

20551 ATGAACAAGG ACCCTGAGAT TCTATCATGG ACAATCCCTC CTTCTATAGA 

20601 GAAAACCAGC TTCTCCTCTT CCCTGATGCC TTCACCAGCC ATGACTTCAC 

20651 CTCCTGTTTC CTCAACATTA CCAAAGACCA TTCACACCAC TCCTTCTCCT 

20701 ATGACCTCAC TGCTCACCCC TAGCCTAGTG ATGACCACAG ACACATTGGG 

20751 CACAAGCCCA GAACCTACAA CCAGTTCACC TCCAAATTTG AGCAGTACCT 

20801 CACATGTGAT ACTGACAACA GATGAAGACA CCACAGCTAT AGAAGCCATG 

20851 CAAGCACAGC AGCGACTAAT GTGGAAACCA CCTGTTCTGG 

20901 ACATGGGTCA CAATCCTCTG TCCTAACTGA CTCAGAAAAA ACCAAGGCCA 

20951 CAGCTCCAAT GGATACCACC TCCACCATGG GGCATACAAC TGTTTCCACA 

21001 TCAATGTCTG TTTCCTCTGA GACTACAAAA ATTAAGAGAG AGTCAACATA 

21051 TTCCTTGACT CCTGGACTGA GAGAGACCAG CATTTCCCAA AATGCCAGCT 

21101 TTTCCACTGA.CACAAGTATT GTTCTTTCAG AAGTCCCCAC TGGTACTACT 

21151 GCTGAGGTCT CCAGGACAGA AGTCACCTCC TCTGGTAGAA CATCCATCCC 

21201 TGGCCCTTCT CAGTCCACAG TTTTGCCAGA AATATCCACA AGAACAATGA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



21251 

21301 

21351 

21401 

21451 

21501 

21551 

21601 

21651 

21701 

21751 

21801 

21851 

21901 

21951 

22001 

22051 

22101 

22151 

22201 

22251 

22301 

22351 

22401 

22451 



CAAGGCTCTT TGCCTCGCCC ACCATGACAG AATCAGCAGA AATGACCATC 
CCCACTCA7UV CAGGTCCTTC TGGGTCTACC TCACAGGATA CCCTTACCTT 
GGACACATCC ACCACAAAGT CCGAGGCT^ GACTCATTCA ACTTTGACTC 
AGAGATTTCC ACACTCAGAG ATGACCACTC TCATGAGCAG AGGTCCTGGA 
GATATGTCAT GGCAAAGCTC TCCCTCTCTG GAAAATCCCA GCTCTCTCCC 
TTCCCTGCTG TCTTTACCTG CCACAACCTC ACCTCCTCCC ATTTCCTCCA 
CATTACCAGT GACTATCTCC TCCTCTCCTC TTCCTGTGAC TTCACTTCTC 
ACCTCTAGCC CGGTAACGAC CACAGACATG TTACACACAA GCCCAGAACT 
TGTAACCAGT TCACCTCCAA AGCTGAGCCA CACTTCAGAT GAGAGACTGA 
CCACTGGCAA GGACACCACA AATACAGAAG CTGTGCATCC TTCCACAAAC 
ACAGCAGCGT CCAATGTGGA GATTCCCAGC TTTGGACATG AATCCCCTTC 
CTCTGCCTTA GCTGACTCAG AGACATCCAA AGCCACATCA CCAATGTTTA 
TTACCTCCAC CCAGGAGGAT ACAACTGTTG CCATATCAAC CCCTCACTTC 
TTGGAGACTA GCAGAATTCA GAAAGAGTCA ATTTCCTCCC TGAGCCCTAA 
ATTGAGGGAG ACAGGCAGTT CTGTGGAGAC AAGCTCAGCC ATAGAGACAA 
GTGCTGTCCT TTCTGAAGTG TCCATTGGTG CTACTACTGA GATCTCCAGG 
ACAGAAGTCA CCTCCTCTAG CAGAACATCC ATCTCTGGTT CTGCTGAGTC 
CACAAtGtTG CCAGAAATAT CCACCA AAAAATCATT AAGTTCCCTA 

CTTCCCCCAT CCTGGCAGAA TCATCAGAAA TGACCATCAA GACCCAAACA 
AGTCCTCCTG GGTCTACATC AGAGAGTACC TTTACATTAG ACACATCAAC 
CACTCCCTCC TTGGTAATAA CCCATTCGAC TATGACTCAG AGATTGCCAC 
ACTCAGAGAT AACCACTCTT GTGAGTAGAG GTGCTGGGGA TGTGCCACX3G 
CCCAGCTCTC TCCCTGTGGA AGAAACAAGC CCTCCATCTT CCCAGCTGTC 
TTTATCTGCC ATGATCTCAC CTTCTCCTGT TTCTTCCACA TTACCAGCAA 
GTAGCCACTC CTCTTCTGCT TCTGTGACTT CACCTCTCAC ACCAGGCCAA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



22501 
22551 
22601 
22651 
22701 
22751 
22801 
22851 
22901 
22951 
23001 
23051 
23101 
23151 
23201 
23251 
23301 
.23351 
23401 
23451 
23501 
23551 
23601 
23651 
23701 



GTGAAGACTA CTGAGGTGTT GGACGCAAGT GCAGAACCTG AAACCAGTTC 
ACCTCCAAGT TTGAGCAGCA CCTCAGTTGA AATACTGGCC ACCTCTGAAG 
TCACCACAGA TACGGAGAAA ATTCATCCTT TCCCAAACAC GGCAGTAACC 
AAAGTTGGAA CTTCCA6TTC TGGACATGAA TCCCCTTCCT CTGTCCTACC 
TGACTCAGAG ACAACCAAAG CCACATCGGC AATGGGTACC ATCTCCATTA 
TGGGGGATAC AAGTGTTTCT ACATTAACTC CTGCCTTATC TAACACTAGG 
AAAATTCAGT CAGAGCCAGC TTCCTCACTG ACCACCAGAT TGAGGGAGAC 
CAGCACCTCT GAAGAGACCA GCTTAGCCAC AGAAGCAAAC ACTGTTCTTT 
CTAAAGTGTC CACTGGTGCT ACTACTGAGG TCTCCAGGAC AGAAGCCATC 
TCCTTTAGCA GAACATCCAT GTCAGGCCCT GAGCAGTCCA CAATGTCACA 
AGACATCTCC ATAGGAACCA TCCCCAGGAT TTCTGCCTCC TCTGTCCTGA 
CAGAATCTGC AAAAATGACC ATCACAACCC AAACAGGTCC TTCGGAGTCT 
ACACTAGAAA GTACCCTTAA TTTGAACACA GCAACCACAC CCTCTTGGGT 
GGAAACCCAC TCTATAGTAA TTCAGGGATT TCCACACCCA GAGATGACCA 
CTTCCATGGG CAGAGGTCCT GGAGGTGTGT CATGGCCTAG CCCTCCCTTT 
GTGAAAGAAA CCAGCCCTCC ATCCTCCCCG CTGTCTTTAC CTGCCGTGAC 
CTCACCTCAT CCTGTTTCCA CCACATTCCT AGCACATATC CCCCCCTCTC 
CCCTTCCTGT GAGTTCAGTT CTCACCTCTG GGCGGGCGAC AACCACAGAT * ' " 
ATCTTGGGTA CAAGCACAGA ACCTGGAACC AGTTCATCTT CAAGTTTGAG 
CACCACCTCC CATGAGAGAC TGACCACTTA CAAAGACACT GCACATACAG 
AAGCCGTGCA TCCTTCCACA AACACAGGAG GGACCAATGT GGCAACCACC 
AGCTCTGGAT ATAAATCACA GTCCTCTGTC CTAGCTGACT CATCTCCAAT 
GTGTACCACC TCCACCATGG GGGATACAAG TGTTCTCACA TCAACTCCTG 
CCTTCCTTGA GACTAGGAGG ATTCAGACAG AGCTAGCTTC CTCCCTGACC 
CCTGGATTGA GGGAGTCCAG TGGCTCTGAA GGGACCAGCT CAGGCACCAA 
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Table 4 (continued) 

Hnman cDNA of CA125 
(S£QIDNO:4) 

23751 GATGAGCACT GTCCTCTCTA AAGTGCCCAC TGGTGCTACT ACTGAGATCT 
23801 CCAAGGAAGA CGTCACCTCC ATCCCAGGTC CCGCTCAATC CACAATATCA 
23851 CCA6ACATCT CCACAAGAAC CGTCAGCTGG TTCTCTACAT CCCCTGTCAT 
23901 GACAGAATCA GCAGAAATAA CCATGAACAC CCATACAAGT CCTTTAGGGG 
23951 CCACAACACA AGGCACCAGT ACTTTGGCCA CGTCAAGCAC AACCTCTTTG 
24001 ACAATGACAC ACTCAACTAT ATCTCAAGQA TTTTCACACT CACAGATGAG 
24051 CACTCTTATG AGGAGGGGTC CTQAGGATGT ATCATGGATG AGCCCTCCCC 
24101 TTCTGGAAAA AACTAGACCT TCCTTTTCTC TGATGTCTTC ACCAGCCACA 
24151 ACTTCACCTT CTCCTGTTTC CTCCy^CATTA CCA6AGAGCA TCTCTTCCTC 
24201 TCCTCTTCCT GTGACTTCAC TCCTCACGTC TGGCTTGGCA AAAACTACAG 
24251 ATATGTTGCA CAAAAGCTCA GAACCTGTAA CCAACTCACC TGCAAATTTG 
24301 AGCAGCACCT CAGTTGAAAT ACTGGCCACC TCTGAAGTCA CCACAGATAC 
24351 AGAGAAAACT CATCCTTCTT CAAACAGAAC AGTGACCGAT GTGGGGACCT 
24401 CCAGTTCTGG ACATGAATCC ACTTCCTTTG TCCTAGCTGA CTCACAGACA 
24451 TCCAAAGTCA CATCTCCAAT GGTTATTACC TCCACCATGG AGGATACGAG 
24501 TGTCTCCACA TCSVACTCCTG GCTTTTTTGA GACTAGCAGA ATTCAGACAG 
24551 AACCAACATC CTCCCTGACC CTTGGACTGA GAAAGACX»G CAGCTCTGAG 
24601 GGGACCA6CT TAGCCACftGA GATGAGCACT GTCCTTTCTG GAGTGCCCAC 
24651 TGGTGCCACT GCTGAAGTCT CCAGGACAGA AGTCACCTCC TCTAGCAGAA 
24701 CATCCATCTC AGGCTTTGCT CAGCTCACAG TGTCACCAGA GACTTCCACA 
24751 GAAACCATCA CCAGACTCCC TACCTCCAGC ATAATQACAG AATCAGCAGA 
24801 AATGATGATC AAGACACAAA CAGATCCTCC TGGGTCTACA CCAGAGAGTA 
24851 CTCATACTGT GGACATATCA ACAACACCCA ACTGGGTAGA AACCCACTCG 
24901 ACTGTGACTC AGAGATTTTC ACACTCAGAG ATGACCACTC TTGTGAGCAG 
24951 AAGCCCTGGT GATATGTTAT GGCCTAGTCA ATCCTCTGTG GAAGAAACCA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQroNO:4) 



25001 GCTCTGCCTC TTCCCTGCTG TCTCTGCCTG CCACGACCTC ACCTTCTCCT 

25051 GTTTCCTCTA CATTAGTAGA GGATTTCCCT TCCGCTTCTC TTCCTGTGAC 

2S101 TTCTCTTCTC ACCCCTGGCC TGGTGATAAC CACAGACAGG ATGGGCATAA 

25151 GCAGAGAACC TGGAACCAGT TCCACTTCAA ATTTGAGCAG CACCTCCCAT 

25201 GAGAQACTGA CCACTTTGQA AGACACTGTA GATACAGAAG ACATGCAGCC 

25251 TTCCACACAC ACAGCAGTGA CCAACGTGAG GACCTCCATT TCTGGACATG 

25301 AATCACAATC TTCTGTCCTA TCTGACTCAG AGACACCCAA AGCCACATTCT 

25351 CCAATGGGTA CCACCTACAC CATGGGGGAA ACGAGTGTTT CCATATCCAC 

25401 TTCTGACTTC TTTQAGACCA GCAGAATTCA GATAGAACCA ACATCCTCCC 

25451 TGACTTCTGG ATTQAGGGAG ACCAGCAGCT CTGAGAGGAT CS^GCTCAGCC 

25501 ACAGAGGGAA GCACTGTCCT TTCTGAAGTG CCCAGTGGTG CTACCACTQA 

25551 GGTCTCCAGG ACAGAAGTGA TATCCTCTAG GGGAACATCC ATGTCAGGGC 

25601 CTGATCAGTT CACCATATCA CCAQACATCT CTACT6AAGC GATCACCaGG 

25651 CTTTCTACTT CCCCCATTAT GACAGAATCA GCAGAAAGTG CCATCACTAT 

25701 TGAGACAGGT TCTCCTGGGG CTACATC3M3A GGGTACCCTC ACCTTGGACA 

25751 CCTCAACAAC AACCTTTTGG TCAGGGACCC ACTCAACTGC ATCTCCAGQA 

25801 TTTTCACACT CAGAGATGAC CACTCTTATG AGTAGAACTC CTGGAQATGT 

25851 GCCATGGCCG AGCCTTCCCT CTeTGGAAGA AGCCAGCTCT GTCTCTTCCT 

25901 CACTGTCTTC ACCTGCCATG ACCTCAACTT CTTTTTTCTC CGCATTACCA 

25951 GAGAGCATCT CCTCCTCTCC TCATCCTGTG ACTGCACTTC TCACCCTTGG 

26001 CCCAGTGAAG ACCACAGACA TGTTGCGCAC AAGCTCAGAA CCTGAAACCA 

26051 GTTCACCTCC AAATTTGAGC AGCACCTCAG CTGAAATATT AGCCACGTCT 

26101 GAAGTCACCA AAGATAGAGA GAAAATTCAT CCCTCCTCAA ACACACCTGT 

26151 AGTCAATGTA GGGACTGTGA TTTATAAACA TCTATCCCCT TCCTCTGTTT 

26201 TGGCTGACTT AGTGACAACA AAACCCACAT CTCCAATGGC TACCACCTCC 



wo 2004/04SSS3 



PCTAJS2003/037041 



Table 4 (continued) 

Human cDNA of CA125 
(SEQIDNO:4) 

26251 ACTCTGGGGA ATACAAGTGT TTCCACATCA ACTCCTGCCT TCCCAGAAAC 

26301 TATGATGACA CAGCCAACTT CCTCCCTGAC TTCTGGATTA AGGGAGATCA 

26351 GTACCTCTCA AGAGACCA6C TCAGCAACAG AGAGAAGTGC TTCTCTTTCT 

26401 GGAATGCCCA CTGGTGCTAC TACTAAGGTC TCCAGAACAG AAGCCCTCTC 

26451 CTTAGGCAGA ACATCCACCC CAGGTCCTGC TCAATCCACA ATATC3VCCAG 

26501 AAATCTCCAC GGAAACCATC ACTAGAATTT CTACTCCCCT CACCACGACA 

26551 GGATCAGCAG AAATGACCAT CACCCCCAAA ACAGGTCATT CTGGGGCATC 

26601 CTCACAAGGT ACCTTTACCT TGGACACATC AAGCAGAGCC TCCTGGCCAG 

26651 GAACTCACTC AGCTGCAACT CACAGATCTC CACACTCAGG GATGACCACT 

26701 CCTATGAGCA GAGGTCCTGA GGATGTGTCA TGGCCAAGCC GCCCATCAGT 

26751 GGAAAAAACT AGCCCTCCAT CTTCCCTGGT GTCTTTATCT GCAGTAACCT 

26801 CACCTTCGCC ACTTTATTCC ACACCATCTG AGAGTAGCCA CTCATCTCCT 

26851 CTCCGGGTGA CTTCTCTTTT CACCCCTGTC ATGATGAAGA CCACAGACAT 

26901 GTTGGACACA AGCTTGGAAC CTGTGACCAC TTCACCTCCC AGTATGAATA 

26951 TCACCTCA6A TGAGAGTCTG GCCACTTCTA AAGCCACCAT GGAGACAGAG 

27001 GCAATTCAGC TTTCAGAAAA CACAGCTGTG ACTCAGATGG GCACCATCAG 

27051 CGCTAGACAA GAATTCTATT CCTCTTATCC AGGCCTCCCA GAGCCATCCA * 

27101 AAGTGACATC TCCAGTGGTC ACCTCTTCCA CCATAAAAGA CATTGTTTCT 

27151 ACAACCATAC CTGCTTCCTC TGAGATAACA AGAATTGAGA TGGAGTCAAC 

27201 ATCCACCCTG ACCCCCACAC CAAGGGAGAC CAGCACCTCC CAGGAGATCC 

27251 ACTCAGCCAC AAAGCCAAGC ACTGTTCCTT ACAAGGCACT CACTAGTGCC 

27301 ACGATTGAGG ACTCCATGAC ACAAGTCATG TCCTCTAGCA GAGGACCTAG 

27351 CCCTGATCAG TCCACAATGT CACAAGACAT ATCCAGTGAA GTGATCACCA 

27401 GGCTCTCTAC CTCCCCCATC AAGGCAGAAT CTACAGAAAT GACCATTACC 

27451 ACCCAAACAG GTTCTCCTGG GGCTACATCA AGGGGTACCC TTACCTTGGA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQroNO:4) 



27501 CACTTCAACA ACTTTTATGT CAGGGACCCA CTCAACTGCA TCTCAAGGAT 
27551 TTTCACACTC ACAGATGACC GCTCTTATGA GTAGAACTCC TGGAQATGTG 
27601 CCATGGCTAA GCCATCCCTC TGTGGAAGAA GCCAGCTCTG CCTCTTTCTC 
27651 ACTGTCTTCA CCTGTCATGA CCTCATCTTC TCCGGTTTCT TCCACATTAC 
27701 CAGACAGCAT CCACTCTTCT TCGCTTCCTG TGACATCACT TCTCACCTCA 
27751 GGGCTGGTGA AGACCACAGA GCTGTTGGGC ACAAGCTCAG AACCTGAAAC 
27801 CAGTTCACCC CCAAATTTGA GCAGCACCTC AGCTGAAATA CTGGCCACCA 
27851 CTGAAGTCAC TACAGATACA GAGAAACTGG AGATGACCAA TGTGGTAACC 
27901 TCAGGTTATA CACATGAATC TCCTTCCTCT GTCCTAGCTG ACTCAGTGAC 
27951 AACAAAGGCC ACATCTTCAA TGGGTATCAC CTACCCCACA GGAGATACAA 
28001 ATGTTCTCAC ATCAACCCCT GCCTTCTCTG ACACCAGTAG QATTCAAACA 
28051 AAGTCAAAGC TCTCACTGAC TCCTGGGTTG ATGGAGACCA GCATCTCTGA 
28101 AGAGACCAGC TCTGCCACAG AAAAAAGCAC TGTCCTTTCT A6TGTGCCCA 
28151 CTGGTGCTAC TACTQAGGTC TCOMSGACAG AAGCCATCTC TTCTAGCAGA 
28201 ACATCCATCC CAGGCCCTGC TCAATCCACA ATGTCATCAG ACACCTCCAT 
28251 GGAAACCATC ACTAGAATTT CTACCCCCCT CACAAGGAAA GAATCAACAG 
28301 ACATGGCCAT CACCCCCAAA ACAGGTCCTT CTGGGGCTAC CTCX3CAGGGT 
28351 ACCOTTACCT TGGAGTGATG AAGCACAGCC TCCTGGCCAG GAACTCACTC 
28401 AGCTACAACT CAGAGATTTC CACAGTCAGT GGTGACAACT CCTATGAGCA 
28451 GAGGTCCTGA GGATGTGTCA TGGCCAAGCC CGCTGTCTGT GQAAAAAAAC 
28501 AGCCCTCCAT CTTCCCTGGT ATCTTCATCT TCAGTAACCT CACCTTCGCC 
28551 ACTTTATTCC ACACCa.TCTG GGAGTAGCCA CTCCTCTCCT GTCCCTGTCA 
28601 CTTCTCTTTT eACCTCTATC ATGATOAAGG CCACAGACAT GTTGQATGCA 
28651 AGTTTGGAAC CTQAGACCAC TTCAGCTCCC AATATGAATA TCACGTCAGA 
28701 TGAGAGTCTG GCCACTTCTA AAGCCACCAC GGAGAC3M3AG GCAATTCAC6 
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Table-4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



28751 TTTTTGAAAA TACAGCAGCG TCCCATGTGG AAACCACCAG TGCTACAGAG 

28801 GAACTCTATT CCTCTTCCCC AGGCTTCTCA GAGCCAACAA AAGTGATATC 

28851 TCCAGTGGTC ACCTCTTCCT CTATAAGAGA CAACATGGTT TCCACAACAA 

28901 TGCCTGGCTC CTCTGGCATT ACAAGGATTG AGATAGAGTC AATGTCATCT 

28951 CTGACCCCTG GACTGAGGGA GACCAGAACC TCCCAGGACA TCACCTCATC 

29001 CACAGAGACA AGCACTGTCC TTTACAAGAT GTCCTCTGGT GCCACTCCTG 

29051 AGGTCTCCAG GACAGAAGTT ATGCCCTCTA GCAGAACATC CATTCCTGGC 

29101 CCTGCTCAGT CCACAATGTC ACTAGACATC TCCGATGAAG TTGTCACCAG 

29151 GCTGTCTACC TCTCCCATCA TGACAGAATC TGCAGAAATA ACCATCACCA 

29201 CCCAAACAGG TTATTCTCTG GCTACATCCC AGGTTACCCT TCCCTTGGGC 

29251 ACCTCAATGA CCTTTTTGTC AGGGACCCAC TCAACTATGT CTCAAGGACT 

29301 TTCACACTCA GAGATGACCA ATCTTATGAG CAGGGGTCCT GAAAGTCTGT 

29351 CATGGACGAG CCCTCX3CTTT GTGGAAACAA CTAGATCTTC CTCTTCTCTG 

29401 ACATCATTAC CTCTCACQAC CTCACTTTCT CCTGTGTCCT CCACATTACT 

29451 AGACAGTAGC CCCTCCTCTC CTCTTCCTGT GACTTCACTT ATCCTCCCAG 

29501 GCCTGGTGAA GACTACAQAA GTGTTGGATA CAAGCTCAGA GCCTAAAACC 

29551 AGTTCATCTC CAAATTTGAG CAGCACCTCA GTTGAAATAC CGGCCACCTC 

29601 TGAAATCATG ACAGATACAG . AGAAAATTCA TCCTT€CTCA • AACAGAGCGG 

29651 TGGCCAAAGT GAGGACCTCC AGTTCTGTTC ATGAATCTCA TTCCTCTGTC 

29701 CTAGCTGACT CAGAAACAAC CATAACCATA CCTTCAATGG GTATCACCTC 

29751 CGCTGTGGAC GATACCACTG TTTTCACATC AAATCCTGCC TTCTCTGAGA 

29801 CTAGGAGGAT TCCGACAGAG CCAACATTCT CATTGACTCC TGGATTCAGG 

29851 GAGACTAGCA- CCTCTGAAGA GACCACCTCA ATCACAGAAA CAAGTGCAGT 

29901 CCTTTATGGA GTGCCCACTA GTGCTACTAC TGAAGTCTCC ATGACAGAAA 

29951 TCATGTCCTC TAATAGAACA CACATCCCTG ACTCTGATCA GTCCACGATG 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



30001 TCTCCAGACA TCATCACTGA AGTGATCACC AGGCTCTCTT CCTCATCCAT 

30051 GATGTCAGAA TCAJ^CACAAA TGACCATCAC CACCCAAAAA AGTTCTCCTG 

30101 GGGCTACAGC ACAGAGTACT CTTACCTTGG CCACAAGAAC AGCCCCCTTG 

3ai51 GCAAGGACCC ACTCAACTGT TCCTCCTAGA TTTTTACACT CAGAGATGAC 

30201 AACTCTTATG AGTAGGAGTC CTGAAAATCC ATCATGGAAG AGCTCTCCCT 

30251 TTGTGGAAAA AACTAGCTCT TCATCTTCTC TGTTGTCCTT ACCTGTCACG 

30301 ACCTCACCTT CTGTTTCTTC CACATTACCG CAGAGTATCC CTTCCTCCTC 

30351 TTTTTCTGTG ACTTCACTCC TCACCCCAGG CATGGTG7UVG ACTACAGACA 

30401 CAAGCACAGA ACCTGGAACC AGTTTATCTC CAAATCTGAG TGGCACCTCA 

30451 GTTGAAATAC TGGCTGCCTC TGAAGTCACC ACAGATACAG AGAAAATTCA 

30501 TCCTTCTTCA AGCATGGCAG TGACCAATGT GGGT^CCACC AGTTCTGGAC 

30551 ATGAACTATA TTCCTCTGTT TCAATCCACT CGGAGCCATC CAAGGCTACA 

30601 TACCCAGTGG GTACTCCCTC TTCCATGGCT GAAACCTCTA TTTCCACATC 

30651 AATGCCTGCT AATTTTGAGA CCACAGGATT TGAGGCTTGAG CCATTTTCTC 

30701 ATTTGACTTC TGGATTTAGG AAGACAAACA TGTCCCTGGA CACCAGCTCA 

30751 GTCACACCAA CAAATACACC TTCTTCTCCT GGGTCCACTC ACCTTTTACA 

30801 GAGTTCCAAG ACTGATTTCA CCTCTTCTGC AAAAACATCA TCCCCAGACT 
3 0 8.5 1 . . . GGCCTCCAGC, CTCACA^STAT. ACTGAAATTC . CAGTGGACA.T AATCACCCCC 

30901 TTTAATGCTT CTCCATCTAT TACGGAGTCC ACTGGGATAA CCTCCTTCCC 

30951 AGAATCCAGG TTTACTATGT CTGTAACAGA AAGTACTCAT CATCTGAGTA 

31001 CAGATTTGCT GCCTTCAGCT GAGACTATTT CCACTGGCAC AGTGATGCCT 

31051 TCTCTATCAG AGGCCATGAC TTCATTTGCC ACGACTGGAG TTCCACGAGC 

31101 CATCTCAGGT- TCAGGTAGTC CATTCTCTAG GACAGAGTCA GGCCCTGGGG 

31151 ATGCTACTCT GTCCACCATT GCAGAGAGCC TGCCTTCATC CACTCCTGTG 

31201 CCATTCTCCT CTTCAACCTT CACTACCACT GATTCTTCAA CCATCCCAGC 
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Table 4 (continued) 



Human cDNAof CA125 
(SEQIDNO:4) 



31251 

31301 

31351 

31401 

31451 

31501 

31551 

31601 

31651 

31701 

31751 ' 

31801 

31851 

31901 

31951 

32001 

32051 

32101 

32151 

32201 

32251 

32301 

32351 

32401 

32451 



CCTCCATQAG ATAACTTCCT CTTCAGCTAC CCCATATAGA GTGGACACCA 
GTCTTGGGAC AGAGAGCAGC ACTACTGAAG GACGCTTGGT TATGGTCAGT 
ACTTTGGACA CTTCAAGCCA ACCAGGCAGG ACATCTTCAA CACCCATTTT 
GGATACCAGA ATGACAGAGA GCX3TTGAGCT GGGAACAGTG AC3VAGTGCTT 
ATCAAGTTCC TTCACTCTCA ACACGGTTGA CAAGAACTGA TGGCATTATG 
GAACACATCA CAAAAATACC CAATGAAGCA GCACACAGAG GTACCATAAG 
ACCAGTCAAA GGCCCTCAGA CATCCACTTC GCCTGCCAGT CCTAAAGGAC 
TACACACAGG AGGGACAAAA AGAATGGAGA CCACCACCAC AGCTTTGAAG 
ACCACCACCA CAGCTTTGAA GACCACTTCC AGAGCCACCT TGACCACCAG 
TGTCTATACT CCCACTTTGG GAACACTGAC TCCCCTCAAT GCATCAAGGC 
AAATGGCCAG CACAATCCTC ACAGAAATGA TGATCACAAC CCCATATGTT 
TTCCCTGATG TTCCAGAAAC GACATCCTCA TTGGCTACCA GCCTGGGAGC 
AGAAACCAGC ACAGCTCTTC CCAGGACAAC CCCATCTGTT CTCAATAGAG 
AATCAGAGAC CACAGCCTCA CTGGTCTCTC GTTCTGGGGC AGAGAGAAGT 
CCGGTTATTC AA?^CTCTAGA TGTTTCTTCT AGTGAGCCAG ATACAACAGC 
TTCATGGGTT ATCCATCCTG CAGAGACCAT CCCAACTGTT TCCAAGACAA 
CCCCCAATTT TTTCCACAGT GAATTAGACA CTGTATCTTC CACAGCCACC 
AGTCATGGGG CAGACGTCAG CTCAGCCATT CGAAGAAftTA TCTCACCTAG 
TGAACTAGAT GCACTGACCC CACTGGTCAC TATTTCGGGG ACAGATACTA 
GTACAACATT CCCAACACTG ACTAAGTCCC CACATGAAAC AGAGACAAGA 
ACCACATGGC TCACTCATCC TGCAGAGACC AGCTCAACTA TTCCCAGAAC 
AATCCCCAAT TTTTCTCATC ATGAATCAGA TGCCACACCT TCAATAGCCA 
CCAGTCCTGG GGCAGAAACC AGTTCAGCTA TTCCAATTAT GACTGTCTCA 
CCTGGTGCAG AAGATCTGGT GACCTCACAG GTCACTAGTT CTGGGACAGA 
CAGAAATATG ACTATTCCAA CTTTGACTCT TTCTCCTGGT GAACCAAAGA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQroNO:4) 



32501 
32551 
32601 
32651 
32701 
32751 
32801 
32851 
32901 
32951 
33001 
33051 
33101 
33151 
33201 
33251 
33301 
33351 
33401 
33451 
33501 
33551 
33601 
33651 
33701 



CGATAGCCTC ATTAGTCACC CATCCTGAAG CACAGACAAG TTCGGCCATT 
CCAACTTCAA CTATCTCGCC TGCTGTATCA CGGTTGGTGA CCTCAATGGT 
CACCAGTTTG GCGGCT^GA CAAGTACAAC TAATCGAGCT CTGACAAACT 
CCCCTGGTGA ACCAGCTACA ACAGTTTCAT TGGTCACGCA TCCTGCACA6 
ACCAGCCCAA CAGTTCCCTG GACAACTTCC ATTTTTTTCC ATAGTAAATC 
AGACACCACA CCTTCAATGA CCACCAGTCA TGGGGCAGAA TCCAGTTCAG 



CTGTTCCAAC TCCAACTGTT TCAACTGAGG TACCAGGAGT AGTGACCCCT 
TTGGTCACCA GTTCTAGGGC AGTGATCAGT ACAACTATTC CAATTCTGAC 
TCTTTCTCCT GGTGAACCAG AGACCACACC TTCAATGGCC ACCAGTCATG 
GGGAAGAAGC CAGTTCTGCT ATTCCT^CTC C2\ACTGTTTC ACCTGGGGTA 
CCAGGAGTGG TGACCTCTCT GGTCACTAGT TCTAGGGCAG TGACTAGTAC 
AACTATTCCA ATTCTGACTT TTTCTCTTGG TGAACCAGAG ACCACACCTT 
CAATGGCCAC CAGTCATGGG ACAGAAGCTG GCTCAGCTGT TCCAACTGTT 
TTACCTGAGG TACCAGGAAT GGTGACCTCT CTGGTTGCTA GTTCTAGGGC 
AGTAACCAGT ACAACTCTTC CAACTCTGAC TCTTTCTCCT GGTGAACCAG 
AGACCACACC TTCAATGGCC ACCAGTCATG GGGCAGAAGC CAGCTCAACT 
GTTCCAACTG TTTCACCTGA GGTACCAGGA 6TGGTGACCT CTCTGGTCAC 
TAGTTCTAGT GGAGTAAACA GTACAAGTAT TCCAACTCTG ATTCTTTCTC ' 
CTGGTGAACT AGAAACCACA CCTTCAATGG CCACCAGTCA TGGGGCAGAA 
GCCAGCTCAG CTGTTCCAAC TCCAACTGTT TCACCTGGGG TATCA6GAGT 
GGTGACCCCT CTGGTCACTA GTTCCAGGGC AGTGACCAGT ACAACTATTC 
CAATTCTAAC TCTTTCTTCT AGTGAGCCAG AGACCACACC TTCAATGGCC 
ACCAGTCATG GGGTAGAAGC CAGCTCAGCT GTTCTAACTG TTTCACCTGA 
GGTACCAGGA ATGGTQACCT CTCTGGTCAC TAGTTCTAGA GCAGTAACCA 
GTACAACTAT TCCAACTCTG ACTATTTCTT CTGATGAACC AGAGACCACA 
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Table4(coDtinaed) 



Human cDNA of CA125 
(SEQIDNO:4) 



33751 
33801 
33851 
33901 
33951 
34001 
34051 
34101 
34151 
34201 
34251 
34301 
34351 
34401 
34451 
34501 
34551 
34601 
34651 
34701 
34751 
34801 
34851 
34901 
34951 



ACTTCATTGG TCACCCATTC TGAGGCAAAG ATGATTTCAG CCATTCCAAC 
TTTAGCTGTC TCCCCTACTG TACAAGGGCT GGTGACTTCS^ CTGGTCACTA 
GTTCTGGGTC AGAGACCAGT GCGTTTTCAA ATCTAACTGT TGCCTCAAGT 
CAACCAGAGA CCATAGACTC ATGGGTCGCT CATCCTGGGA CAGAAGCAAG 
TTCTGTTGTT CCAACTTTQA CT6TCTCCAC TGGTQAGCCG TTTACAAATA 
TCTCATTGGT CACCCATCCT GCAGAGAGTA GCTCAACTCT TCCCAGGACA 
ACCTCAAGGT TTTCCCACAG TGAATTAGAC ACTATGCCTT CTACAGTCAC 
CAGTCCTGAG GCaGAATCCA GCTCAGCCAT TTCAACTACT ATTTCACCTG 
GTATACCAGG TGTGCTGACA TCACTGGTCA CTAGCTCTGG GAGAGACATC 
AGTGCAACTT TTCCAACAGT GCCTGAGTCC CCACATGAAT CAGAGGCAAC 
AGCCTCATGG GTTACTCATC CTGCAGTCAC CAGCACAACA GTTCCCAGGA 
CAACCCCTAA TTATTCTCAT AGTGAACCA6 ACACCACACC ATCAATAGCC 
ACCAGTCCTG GGGCAGAAGC CACTTCAGAT TTTCCAACAA TAACT6TCTC 
ACCTGATGTA CCAGATATGG TAACCTCACA GGTCACTAGT TCTGGGACAG 
ACACCAGTAT AACTATTCCA ACTCTGACTC TTTCTTCTGG TGAGCCAGAG 
ACCACAACCT CATTTATCAC CTATTCTGAG ACACACACAA GTTCAGCCAT 
TCCAACTCTC CCTGTCTCCC CTGGTGCATC AAAGATGCTG ACCTCACTGG 
T:ChTCi>&TXC..J!GGI3AC]^C AGCACTACAA CTTTCCCAAC ACTQACGQAG 
ACCCCATATG AACCAGAGAC AACAGCCATA CAGCTCATTC ATCCTGCAGA 
QACCAACACA ATGGTTCCCA AGACAACTCC CAACTTTTCC CATAGTAAGT 
CAGACACCAC ACTCCCA6TA GCCATCACCA GTCCTGGGCC AGAAGCC3V6T 
TCAGCTGTTT CAACGACAAC TATCTCACCT GATATGTCAG ATCTGGTGAC 
CTCACTGGTC CCTAGTTCTG GQACAGACaC CAGTA(aACC TTCCCAACAT 
TQAGTGAGAC CCCATATGAA CCAGAGACTA CAGTCACGTG GCTCACTCAT 
CCTGCAGAAA CCAGCACAAC G6TTTCTGGG ACAATTCCCA ACTTTTCCCA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQ ID NO: 4) 



35001 T2W3GGGATCA GACACTGCAC CCTCAATGGT CACCAGTCCT GGAGTAGACA 
35051 CGAGGTCAGG TGTTCCAACT ACAACCATCC CACCCAGTAT ACCAGGGGTA 
35101 GTCACCTCAC AGGTCACTAG TTCTGCAACA GACACTAGTA CAGCTATTCC 
35151 AACTTTGACT CCTTCTCCTG GTGAACCAGA QACCACAGCC TCATCAGCTA 
35201 CCCATCCTGG GACACAGACT GGCTTCACTG TTCCAATTCG GACTGTTCCC 
35251 TCTAGTGAGC CAGATACAAT GGCTTCCTGG GTCACTCATC CTCCACAGAC 
35301 CAGCACACCT GTTTCCAGAA CAACCTCCAG TTTTTCCCAT AGTAGTCCAG 
35351 ATGCCACACC TGTAATGGCC ACCAGTCCTA GGACAGAAGC CAGTTCAGCT 
35401 GTACTGACAA CAATCTCACC TGGTGCACCA GAGATGGTGA CTTCACAGAT 
35451 CACTAGTTCT GGGGCAGCAA CCAGTACAAC TGTTCCAACT TTGACTCATT 
35501 CTCCTGGTAT GCCAGAGACC ACAGCCTTAT TGAGCACCCA TCCCAGAACA 
35551 GGGACAAGTA AAACATTTCC TGCTTCAACT GTGTTTCCTC AAGTATCAGA 
35601 GACCACAGCC TCACTCACCA TTAQACCTGG TGCAGAGACT AGCACAGCTC 
35651 TCCCAACTCA GACAACATCC TCTCTCTTCA CCCTACTTGT AACTGGAACC 
35701 AGCAGAGTTG ATCTAAGTCC AACTGCTTCA CCTGGTGTTT CTGCAAAAAC 
35751 AGCCCCACTT TCCACCCATC CAGGGACAGA GACCAGCACA ATGATTCCAA 
35801 CTTCAACTCT TTCCCTTGGT TTACTAQASA CTACAGGCTT ACTGGCCACC 
35851 AGCTCTTCAG CAGAGACGAG CACGAGTACT CTAACTCTGA CTGTTTCCCC 
35901 TGCTGTCTCT G«3CTTTCCA GTGCCTCTAT AACAACTGAT AAGCCCCAAA 
35951 CTGTQACCTC CTGGAACACA GAAACCTCAC CATCTGTAAC TTCAGTTGGA 
36001 ■ CCCCCAGAAT TTTCCAGGAC TGTCACAGGC ACCACTATGA CCTTGATACC 
36051 ATCAGAGATG CCAACACCAC CTAAAACCAG TCATGGAGAA GGAGTGAGTC 
36101 CAACCACTAT CTTGAGAACT ACAATGGTTG AAGCCACTAA TTTAGCTACC 
36151 ACAGGTTCCA GTCCCACTGT GGCCAAGACA ACAACCACCT TCAATACACT 
36201 GGCTGQAAGC CTCTTTACTC CTCTGACCAC ACCTGGGATG TCCACCTTGG 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



36251 
36301 
36351 
36401 
36451 
36501 
36551 
36601 
36651 
36701 
36751 
36801 
36851 
36901 
36951 
37001 
37051 
37101 
37151 
37201 
37251 
37301 
37351 
37401 
37451 



CCTCTGAGAG TGTGACCTCA AGAACAAGTT ATAACCATCG GTCCTGGATC 
TCCACCACCA GCAGTTATAA CCGTCGGTAC TGGACCCCTG CCACCAGCAC; 
TCCAGTGACT TCTACATTCT CCCCAGGGAT TTCCACATCC TCCATCCCCA 
GCTCCACAGC AGCCACAGTC CCATTCATGG TGCCATTCAC CCTCAACTTC 
ACCATCACCA ACCTGCAGTA CGAGQAGGAC ATGCGGCACC CTGGTTCCAG 
GAAGTTCAAC GCCACAGAGA GAGAACTGCA GGGTCTGCTC, AAACCCTTGT 
TCAGGAATAG CAGTCTGGAA TACCTCTATT CAGGCTGCAG ACTAGCCTCA 
CTCAGGCCAG AGAAGGATAG CTCAGCCATG GCAGTGGATG CCATCTGCAC 
ACATCGCCCT GACCCTGAAG ACCTCGGACT GGACAGAGAG CGACTGTACT 
GGGAGCTGAG CAATCTGACA AATGGCATCC AGGAGCTGGG CCCCTACACC 
CTGGACCGGA ACAGTCTCTA TGTCAATGGT TTCACCCATC GAAGCTCTAT 
GCCCACCACC AGCACTCCTG GGACCTCCAC AGTGGATGTG GGAACCTCAG 
GGACTCCATC CTCCAGCCqC AGCCCCACGG CTGCTGGCCC TCTCCTGATG 
CCGTTCACCC TCAACTTCAC CATCACCAAC CTGCAGTACG AGGAGGACAT 
GCGTCGCACT GGCTCCAGGA AGTTCAACAC CATGGAGAGT GTCCTGCAGG 
GTCTGCTCAA GCCCTTGTTC AAGAACACCA GTGTTGGCCC TCTGTACTCT 
GGCTGCAGAT TGACCTTGCT CAGGCCCGAG AAAGATGGGG CAGCCACTGG 
AGTCGATGCC ATCTGCACCC ACCeCt^ CCCCAiliAASC " CCTOGACTC^ 
ACAGGGAGCA GCTGTACTGG GAGCTAAGCA AACTGACCAA TGACATTGAA 
GAGCTGGGCC CCTACACCCT GGACAGGAAC AGTCTCTATG TCAATGGTTT 
CACCCATCAG AGCTCTGTGT CCACCACCAG CACTCCTGGG ACCTCCACAG 
TGGATCTCAG AACCTCAGGG ACTCCATCCT CCCTCTCCAG CCCCACAATT 
ATGGCTGCTG GCCCTCTCCT GGTACCATTC ACCCTCAACT TCACCATCAC 
CAACCTGCAG TATGGGGAGG ACATGGGTCA CCCTGGCTCC AGGAAGTTCA 
ACACCACAGA GAGGGTCCTG CAGGGTCTGC TTGGTCCCAT ATTCAAGAAC 
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Table 4 (continued) 



Human cDNA of CA12S 
(SEQ]DNO:4) 



37501 
37551 
37601 
37651 
37701 
37751 
37801 
37851 
37901 
37951 
38001 
38051 
38101 
38151 
38201 
38251 
38301 
38351 
38401 
38451 
38501 
38551 
38601 
38651 
38701 



ACCAGTGTTG GCCCTCTGTA CTCTGGCTGC AGACTGACCT CTCTCAGGTC 
TGAGAAGGAT GGAGCAGCCA CTGGAGTGGA TGCCATCTGC ATCCATCATC 
TTGACCCCAA AAGCCCTGGA CTCAACAGAG AGCGGCTGTA CTGGGAGCTG 
AGCCAACTGA CCAATGGCAT CAAAGAGCTG GGCCCCTACA CCCTGGACAG 
GAACAGTCTC TATGTCAATG GTTTCACCCA TCGGACCTCT GTGCCCACCA 
CCAGCACTCC TGGGACCTCC ACAGTGGACC TTGGAACCTC AGGGACTCCA 
TTCTCCCTCC CAAGCCCCGC AACTGCTGGC CCTCTCCTGG TGCTGTTCAC 
CCTCAACTTC ACCATCACCA ACCTGAAGTA TGAGGAGGAC ATGCATCGCC 
CTGGCTCCAG GAAGTTCAAC ACCACTGAGA GGGTCCTGCA GACTCTGCTT 
GGTCCTATGT TCAAGAACAC CAGTGTTGGC CTTCTGTACT CTGGCTGCAG 
ACTQACCTTG CTCAGGTCCG AGAAGGATGG AGCAGCCACT GGAGTGGATG 
CCATCTGCAC CCACCGTCTT GACCCCAAAA GCCCTGGACT GGACAGAGAG 
CAGCTATACT GGGAGCTGAG CCAGCTGACC AATGGCATCA AAGAGCTGGG 



CCCCTACACC CTGGACAGGA ACAGTCTCTA TGTCAATGGT TTCACCCATT 
GGATCCCTGT GCCCACCAGC AGCACTCCTG GGACCTCCAC AGTGGACCTT 
GGGTCAGGGA CTCCATCCTC CCTCCCCAGC CCCACAGCTG CTGGCCCTCT 
CCTGGTGCCA TTCACCCTCA ACTTCACCAT CACCAACCTG CAGTACGAGG 
AGGACATGCA TCACCCAGGC TCCAGGJ^ TCAACACCAC GQAGCGGGTC 
CTGCAGGGTC TGCTTGGTCC CATGTTCAAG AACACCAGTG TCGGCCTTCT 
GTACTCTGGC TGCAGACTGA CCTTGCTCAG GTCCGAGAAG GATGGAGCAG 
CCACTGGAGT GGATGCCATC TGCACCCACC GTCTTGACCC CAAAAGCCCT 
GGAGTGGACA GGGAGCAGCT ATACTGGGAG CTGAGCCAGC TGACCAATGG 
CATCAAAGAG CTGGGTCCCT ACACCCTGGA CAGAAACAGT CTCTATGTCA 
ATGGTTTCAC CCATCAGACC TCTGCGCCCA ACACCAGCAC TCCTGGGACC 
TCCACAGTGG ACCTTGGGAC CTCAGGGACT CCATCCTCCC TCCCCAGCCC 
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Table 4 (continued) 

Human cDNA of CA125 
(SEQIDNO:4) 

38751 TACATCNGCT GGCCCTCTCC TGGTNCCNTT CACCCTCAAC TTCACCATCA 

38801 CCAACCTGCA GTACGAGGAG GACATGCGGC ACCCNGGNTC CAGQAAGTTC 

38851 AACACCACNG' AGAGGGTNCT GCAGGGTCTG CTNAAGCCCC TNTTCAAGAG 

38901 CACCAGTGTT GGCCCTCTGT ACTCTGGCTG CAGACTGACC TTGCTCAGGT 

38951 CCGAGAAGGA TGGAGCA6CC ACTGGAGTGG ATGCCATCTG CACCCACCGT 

39001 CTTGACCCCA AAAGCCCTGG AGTGGACAGG GAGCAGCTAT ACTGGGAGCT 

39051 GAGCCAGCTG ACCAATGGCA TCAT^GAGCT GGGTCCCTAC ACCCTGGACA 

39101 GAAACAGTCT CTATGTCAAT GGTTTCACCC ATCAGACCTC TGCGCCCAAC 

39151 ACCAGCACTC CTGGGACCTC CACAGTGGAC CTTGGGACCT CAGGGACTCC 

39201 ATCCTCCCTC CCCAGCCCTA CATCTGCTGG CCCTCTCCTG GTGCCATTCA 

39251 CCCTCAACTT CACCATCACC AACCTGCAGT ACGAGGAGGA CATGCATCAC 

39301 CCAGGCTCCA GGAAGTTCAA CACCACGGAG CGGGTCCTGC AGGGTCTGCT 

39351 TGGTCCCATG TTCAAGAACA CCAGTGTCGG CCTTCTGTAC TCTGGCTGCA 

39401 GACTGACCTT GCTCAGGCCT GAGAAGAATG GGGCAGCCAC TGGAATGGAT 

39451 GCCATCTGCA GCCACCGTCT TGACCCCAAA AGCCCTGGAC TCAACAGAGA 

39501 GCAGCTGTAC TGGGAGCTGA GCCAGCTGAC CCATGGCATC AAAGAGCTGG 

39551 GCCCCTACAC CCTGGACAGG AACAGTCTCT ATGTCAATGG TTTCACCCAT 

39601 CGGAGCTCTG TGGCCCCCAC CAGCACTCCT GGGACCTCCA CAGTCGACCT 

39651 TGGGACCTCA GGGACTCCAT CCTCCCTCCC CAGCCCCACA ACAGCTGTTC 

39701 CTCTCCTGGT GCCGTTCACC CTCAACTTTA CCATCACCAA TCTGCAGTAT 

39751 GGGGAGGACA TGCGTCACCC TGGCTCCAGG AAGTTCAACA CCACAGAGAG 

39801 QGTCCTGCAG GGTCTGCTTG GTCCCTTGTT CAAGAACTCC AGTGTCGGCC 

39851 CTCTGTACTC TGGCTGCAGA CTGATCTCTC TCAGGTCTGA GAAGGATGGG 

39901 GCAGCCACTG GAGTGGATGC CATCTGCACC CACCZACCTTA ACCCTCAAAG 

39951 CCCTGGACTG GACAGGGAGC AGCTGTACTG GCAGCTGAGC CAGATGACCA 

163 



wo 2004/0455S3 



PCT/US2003/037041 



Table 4 (continued) 



Human cDNAofCA125 
(SEQroNO:4) 



40001 ATGGCATCAA AGAGCTGGGC CCCTACACCC TGGACCGGAA CAGTCTCTAC 

40051 GTCAATGGTT TCACCCATCG GAGCTCTGGG CTCACCACCA GCACTCCTTG 

40101 GACTTCCACA GTTGACCTTG GAACCTCAGG GACTCCATCC CCCGTCCCCA 

40151 GCCCCACAAC TGCTGGCCCT CTCCTGGTGC CATTCACCCT CAACTTCACC 

40201 ATCACCAACC TGCAGTATGA GGAGGACATG CATCGCCCTG GATCTAGGAA 

40251 GTTCAACACC ACAGAGAGGG TCCTGCAGGG TCTGCTTAGT CCCATTTTCA 

40301 AGAACTCCAG TGTTGGCCCT CTGTACTCTG GCTGCAGACT GACCTCTCTC 

40351 AGGCCCGAGA AGGATGGGGC AGCAACTGGA ATGGATGCTG TCTGCCTCTA 

40401 CCACCCTAAT CCCAAAAGAC CTGGACTGGA CAGAGAGCAG CTGTACTGGG 

40451 AGCTAAGCCA 6CTGACCCAC AACATCACTG AGCTGGGCCC CTACAGCCTG 

40501 GACAGGGACA GTCTCTATGT CAATGGTTTC ACCCATCAGA ACTCTGTGCC 

40551 CACCACCAGT ACTCCTGGGA CCTCCACAGT GTACTGGGCA ACCACTGGGA 

40601 CTCCATCCTC CTTCCCCGGC CACACAGAGC CTGGCCCTCT CCTGATACCA 

40651 TTCACTTTCA ACTTTACCAT CACCAACCTG ^ATTAT5AGG AAAACATGCA 

40701 ACACCCTGGT TCCAGGAAGT TCAACACCAC GGAGAGGGTT CTGCAGGGTC 

40751 TGCTCAAGCC CTTGTTCAAG AACACCAGTG TTGGCCCTCT GTACTCTGGC 

40801 TGCAGACTGA CCTCTCTCAG GCCCGAGAAG GATGGGGCAG CAACTGGAAT 

'46651 'G6ATGCTGTC TGCCTCTACC ACCCTT^TCC CAAAAGACCT GGGCTGGACA 

40901 GAGAGCAGCT GTACTGGGA6 CTAAGCCAGC TGACCCACAA CATCACTGAG 

40951 CTGGGCCCCT ACAGCCTGGA CAGGGACAGT CTCTATGTCA ATGGTTTCAC 

41001 CCATCAGAAC TCTGT6CCCA CCACCAGTAC TCCTGGGACC TCCACAGTGT 

41051 ACTGGGCAAC CACTGGGACT CCATCCTCCT TCCCCGGCCA CACAGAGCCT 

41101 GGCCCTCTCC 'tGMACCATT CACTTTCAAC TTTACCATCA CCAACCTGCA 

41151 TTATGAGGAA AACATGCAAC ACCCTGGTTC CAGGAAGTTC AACACCACGG 

41201 AGAGGGTTCT GCAGG6TCTG CTCAAGCCCT TGTTCAAGAA CACCAGTGTT 
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Table 4 (continued) 



Human cDNA of CA12S 
(SEQIDNO:4) 



41251 
41301 
41351 
41401 
41451 
41501 
41551 
41601 
41651 
41701 
41751 
41801 
41851 
41901 
41951 
42001 
42051 
42101 
42151 
42201 
42251 
42301 
42351 
42401 
42451 



GGCCCTCTGT ACTCTGGCTG CAGACTGACC TTGCTCAGAC CTGAGAAGCA 
TGAGGCAGCC ACTGGAGTGG ACACCATCTG TACCCACCGC GTTGATCCCA 
TCGGACCTGG ACTGGACAGG GAGCGGCTAT ACTGGGAGCT GAGCCAGCTG 
ACCAACAGCA TTACCGAACT GGGACCCTAC ACCCTGGACA GGGACAGTCT 
CTATGTCAAT GGCTTCAACC CTCGGAGCTC TGTGCCAACC ACCAGCACTC 
CTGGGACCTC CACAGTGCAC CTGGCAACCT CTGGGACTCC ATCCTCCCTG 
CCTGGCCACA CAGCCCCTGT CCCTCTCTTG ATACCATTCA CCCTCAACTT 
TACCATCACC AACCTGCATT ATGAGGAAAA CATGCAACAC CCTGGTTCCA 
GGAAGTTCAA CACCACGGAG AGGGTTCTGC AGGGTCTGCT CAAGCCCTTG 
TTCAAGAACA CCAGTGTTGG CCCTCTGTAC TCTGGCTGCA GACTGACCTT 
GCTCAGACCT GAGAAGCATG AGGCAGCCAC TGGAGTGGAC ACCATCTGTA 
CCCACCGCGT TGATCCCATC GGACCTGGAC TGNACAGNGA GCNGCTNTAC 
TGGGAGCTNA GCCANCTGAC CAANNNCATC NNNGAGCTGG GNCCCTACAC 
CCTGGACAGG NACAGTCTCT ATGTCAATGG TTTCACCCAT CNGANCTCTG 
NGCCCACCAC CAGCACTCCT GGGACCTCCA CAGTGNACNT NGGNACCTCN 
GGGACTCCAT CCTCCNTCCC CNGCCNCACA TCTGCTGGCC CTCTCCTGGT 
GCCATTCACC CTCAACTTCA CCATCACCAA CCTGCAGTAC GAGGAGGACA 
TGCATCACCC AGGCTCCAGG.. AAGTTCAAGA CCACGGAGCG GGTCCTGCAG 
GGTCTGCTTG GTCCCATGTT CAAGAACACC AGTGTCGGCC TTCTGTACTC 
TGGCTGCAGA CTGACCTTGC TCAGGCCTGA GAAGAATGG6 GCAGCCACTG 
GAATG6ATGC CATCTGCAGC CACCGTCTTG ACCCCAAAAG CCCTGGACTC 
GACAGAGAGC AGCTGTACTG GGAGCTGAGC CAGCTGACCC ATGGCATCAA 
AGAGCTGGGC CCCTACACCC TGGACAGGAA CAGTCTCTAT GTCAATGGTT 
TCACCCATCG GAGCTCT6TG GCCCCCACCA GCACTCCTGG GACCTCCACA 
GTGGACCTTG GGACCTCAGG GACTCCATCC TCCCTCCCCA GCCCCACAAC 
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Table 4 (continaed) 



Human cDNA of CA125 
(SEQIDNO:4) 



42501 AGCTGTTCCT CTCCTGGTGC CGTTCACCCT CAACTTTACC ATCACCAATC 

42551 TGCAGTATGG GGAGGACATG CX3TCACCCTG GCTCCAGGAA GTTCAACACC 

42601 ACAGAGAGGG TCCTGCAGGG TCTGCTTGGT CCCTTGTTCA AGAACTCCAG 

42651 TGTCGGCCCT CTGTACTCTG GCTGCAGACT GATCTCTCTC AGGTCTGAGA 

42701 AGGATGGGGC AGCCACTGGA GTGGATGCCA TCTGCACCCA CCACCTTAAC 

42751 CCTCAAAGCC CTGGACTGGA CAGGGAGCAG CTGTACTGGC AGCTGAGCCA 

42801 GATGACCAAT GGCATCAAAG AGCTGGGCCC CTACACCCTG GACCGGAACA 

42851 GTCTCTACGT CAATGGTTTC ACCCATCGGA GCTCTGGGCT CACCACCAGC 

42901 ACTCCTTGGA CTTCCACAGT TGACCTTGGA ACCTCAGGGA CTCCATCCCC 

42951 CGTCCCCAGC CCCACAACTG CTGGCCCTCT CCTGGTGCCA TTCACCCTAA 

43001 ACTTCACCAT CACCAACCTG CAGTATGAGG AGGACATGCA TCGCCCTGGA 

43051 TCTAGGAAGT TCAACX3CCAC AGAGAGGGTC CTGCAGGGTC TGCTTAGTCC 

43101 CATATTCAAG AACTCCAGTG TTGGCCCTCT GTACTCTGGC TGCAGACTGA 

43151 CCTCTCTCAG GCCCGAGAAG GATGGGGCAG CAACTGGAAT GGATGCTGTC 

43201 TGCCTCTACC ACCCTAATCC CAAAAGACCT GGACTGGACA GAGAGCAGCT 

43251 GTACTGGGAG CTAAGCCAGC TGACCCACAA CATCACTGAG CTGGGCCCCT 

43301 ACAGCCTGGA CAGGGACAGT CTCTATGTCA ATGGTTTCAC CCATCAGAGC 

43351 TCTATGACGA CCACC^GAAC TCCTGATACC TCCACAATGC ACCTGGCAAC 

43401 CTCGAGAACT CCAGCCTCCC TGTCTGGACC TACGACCGCC AGCCCTCTCC 

43451 TGGTGCTATT CACAATCAAC TGCACCATCA CCAACCTGCA GTACGAGGAG 

43501 GACATGCGTC GCACTGGCTC CAGGAAGTTC AACACCATGG AGAGTGTCCT 

43551 GCAGGGTCTG CTCAAGCCCT TGTTCAAGAA CACCAGTGTT GGCCCTCTGT 

43601 ACTCTGGCTG CAGATTGACC TTGCTCAGGC CCAAGAAAGA TGGGGCAGCC 

43651 ACTGGAGTGG ATGCCATCTG CACCCACCGC CTTGACCCCA AAAGCCCTGG 

43701 ACTCAACAGG GAGCAGCTGT ACTGGGAGCT AAGCAAACTG ACCAATGACA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQroNO:4) 



43751 
43801 
43851 
43901 
43951 
44001 
44051 
44101 
44151 
44201 
44251 
44301 
44351 
44401 
44451 
44501 
44551 
44601 
44651 
44701 
44751 
44801 
44851 
44901 
44951 



TTGAAGAGCT GGGCCCCTAC ACCCTGGACA GGAACAGTCT CTATGTCAAT 
GGTTTCACCC ATCAGAGCTC TGTGTCCACC ACCAGCACTC CTGGGACCTC 
CACAGTGGAT CTCAGAACCT CAGGGACTCC ATCCTCCCTC TCCAGCCCCA 
CAATTATGNC NNCTGNCCCT CTCCTGNTNC CNTTCACCNT CAACTTNACC 
ATCACCAACC TGCANTANGN GGANNACATG CNNCNCCCNG GNTCCAGGAA 
GTTCAACACC ACNGAGAGGG TCCTACAGGG TCTGCTCAGG CCCTTGTTCA 
AGAACACCAG TGTCAGCTCT CTGTACTCTG GTTGCAGACT 6ACCTTGCTC 
AGGCCTGAGA AGGATGGGGC AGCCACCAGA GTGGATGCTG CCTGCACCTA 
CCGCCCTGAT CCCAAAAGCC CTGGACTGGA CAGAGAGCAA CTATACTGGG 
AGCTGAGCCA GCTAACCCAC AGCATCACTG AGCTGGGACC CTACACCCTG 
GACAGGGTCA GTCTCTATGT CAATGGCTTC AACCCTCGGA GCTCTGTGCC 
AACCACCAGC ACTCCTGGGA CCTCCACAGT GCACCTGGCA ACCTCTGGGA 
CTCCATCCTC CCTGCCTGGC CACACANCNN CTGNCCCTCT CCTGNTNCCN 
TTCACCNTCA ACTTNACCAT CACCAACCTG CANTANGNGG ANNACATGCN 
NCNCCCNGGN TCCAGGAAGT TCAACACCAC NGAGAGGGTT CTGCAGGGTC 
TGCTCAAACC CTTGTTCAGG AATAGCAGTC TGGAATACCT CTATTCAGGC 
TGCAGACTAG CCTCACTCAG GCCAGAGAAG GATAGCTCAG CCATGGCAGT 
GGATGCCATC TGCACACATC GCCCTGACCC TGAAGACCTC GGAC^teGACA 
GAGAGCGACT GTACTGGGAG CTGAGCAATC TGACAAATGG CATCCAGGAG 
CTGGGCCCCT ACACCCTGGA CCGGAACAGT CTCTACGTCA ATGGTTTCAC 
CCATCGGAGC TCTGGGCTCA CCACCAGCAC TCCTTGGACT TCCACAGTTG 
ACCTTGGAAC CTCAGGGACT CCATCCCCCG TCCCCAGCCC CACAACTGCT 
GGCCCTCTCC TGGTGCCATT CACCCTCAAC TTCACCATCA CCAACCTGCA 
GTATGAGGAG GACATGCATC GCCCTGGTTC CAGGAGGTTC AACACCACGG 
AGAGGGTTCT GCAGGGTCTG CTCACGCCCT TGTTCAAGAA CACCAGTGTT 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQroNO:4) 



45001 

45051 

45101 

45151 

45201 

45251 

45301 

45351 

45401 

45451 

45501 

45551 

45601 

45651 

45701 

45751 

45801 

45851 

45901 

45951 

46001 

46051 

46101 

46151 

46201 



GGCCCTCTGT ACTCTGGCTG CAGACTGACC TTGCTCAGAC CTGAGAAGCA 
AGAGGCAGCC ACTGGAGTGG ACACCATCTG TACCCACC6C GTTGATCCCA 
TCGGACCTGG ACTGGACAGA GAGCGGCTAT ACTGGGAGCT GAGCCAGCTG 
ACCAACAGCA TCACAGAGCT GGGACCCTAC ACCCTGGATA GGGACAGTCT 
CTATGTCAAT GGCTTCAACC CTTGGAGCTC TGTGCCAACC ACCAGCACTC 
CTGGGACCTC CACAGTGCAC CTGGCAACCT CTGGGACTCC ATCCTCCCTG 
CCTGGCCACA CAGCCCCTGT CCCTCTCTTG ATACCATTCA CCCTCAACTT 
TACCATCACC GACCTGCATT ATGAAGAAAA CATGCAACAC CCTGGTTCCA 
GGAAGTTCAA CACCACGGAG AGGGTTCTGC AGGGTCTGCT CAAGCCCTTG 
TTCAAGAGCA CCAGCGTTGG CCCTCTGTAC TCTGGCTGCA GACTGACCTT 
GCTCAGACCT GAGAAACATG GGGCAGCCAC TGGAGTGGAC GCCATCTGCA 
CCCTCCGCCT TGATCCCACT GGTCCTGGAC TGGACAGAGA GCGGCTATAC 
TGGGAGCTGA GCCAGCTGAC CAACAGCGTT ACAGAGCTGG GCCCCTACAC 
CCTGGACAGG GACAGTCTCT ATGTCAATGG CTTCACCCAT CGGAGCTCTG 
TGCCAACCAC CAGTATTCCT GGGACCTCTG CAGTGCACCT GGAAACCTCT 
GGGACTCCAG CCTCCCTCCC TGGCCACACA GCCCCTGGCC CTCTCCTGGT 
GCCATTCACC CTCAACTTCA CTATCACCAA CCTGCAGTAT GAGGAGGACA 
TGCGTCACCC TGGTTCCAGG AA6TTCAGCA CCAGGGAGAG AGTGGTGCAG 
GGTCTGCTCA AGCCCTTGTT CAAGAACACC AGTGTCAGCT CTCTGTACTC 
TGGTTGCAGA CTGACCTTGC TCAGGCCTGA GAAG6ATGGG GCAGCCACCA 
GAGTGGATGC TGTCTGCACC CATCGTCCTG ACCCCAAAAG CCCTGGACTG 
GACAGAGAGC GGCTGTACTG GAAGCTGAGC CAGCTGACCC ACGGCATCAC 
TGAGCTGGGC GCCTACACCC TGGACAGGCA CAGTCTCTAT GTCAATGGTT 
TCACCCATCA GAGCTCTATG ACGACCACCA GAACTCCTGA TACCTCCACA 
ATGCACCTGG CAACCTCGAG AACTCCAGCC TCCCTGTCTG GACCTACGAC 
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Table 4 (continued) 

Human cDNA of CA125 
(SEQIDNO:4) 

46251 CGCCAGCCCT CTCCTGGTGC TATTCACAAT TAACTTCACC ATCACTAACC 

46301 TGCGGTATGA GGAGAACATG CATCACCCTG GCTCTAGAT^ GTTTAACACC 

46351 ACGGAGAGAG TCCTTCAGGG TCTGCTCAGG CCTGTGTTCA AGAACACCAG 

46401 TGTTGGCCCT CTGTACTCTG GCTGCAGACT GACCACGCTC AGGCCCAAGA 

46451 AGGATGGGGC AGCCACCAAA GTGGATGCCA TCTGCACCTA CCGCCCTGAT 

46501 CCC7VAAAGCC CTGGACTGGA CAGAGAGCAG CTATACTGGG AGCTGAGCCA 

46551 GCTAACCCAC AGCATCACTG AGCTGGGCCC CTACACCCAG GACAGGGACA 

46601 GTCTCTATGT CAATGGCTTC ACCCATCGGA GCTCTGTGCC AACCACCAGT 

46651 ATTCCTGGGA CCTCTGCAGT GCACCTGGAA ACCTCTGGGA CTCCAGCCTC 

46701 CCTCCCTGGC CACACAGCCC CTGGCCCTCT CCTGGTGCCA TTCACCCTCA 

46751 ACTTCACTAT CACCAACCTG CAGTATGAGG AGGACATGCG TCACCCTGGT 

46801 TCCAGGAAGT TCAACACCAC GGAGAGAGTC CTGCAGGGTC- TGCTCAAGCC 

46851 CTTGTTCAAG AGCACCAGTG TTGGCCCTCT GTACTCTGGC TGCAGACTGA 

46901 CCTTGCTCAG GCCTGAAAAA CGTGGGGCAG CCACCGGCGT GGACACCATC 

46951 TGCACTCACC GCCTTGACCC TCTAAACCCA GGACTGGACA GAGAGCAGCT 

47001 ATACTGGGAG CTGAGCAAAC TGACCCGTGG CATCATCGAG CTGGGCCCCT 

47051 ACCTCCTGGA CAGAGGCAGT CTCTATGTCA ATGGTTTCAC CCATCGGACC 

47101 TCTGTGCCCA CCACCAGCAC TCCTGGGACC TCCACAGTGG ACCTTGGAAC 

47151 CTCAGGGACT CCATTCTCCC TCCCAAGCCC CGCANCNNCT GNCCCTCTCC 

47201 TGNTNCCNTT CACCNTCAAC TTNACCATCA CCAACCTGCA NTANONTGCAN 

4725} NACATGCNNC NCCCNGGNTC CAGGAAGTTC AACACCACNG AGAGGGTCCT 

47301 GCAGACTCTG CTTGGTCCTA TGTTCAAGAA CACCAGTGTT GGCCTTCTGT 

47351 ACTCTGGCTG-.CAGACTGACC TTGCTCAGGT CCGAGAAGGA TGGAGCAGCC 

47401 ACTGGAGTGG ATGCCATCTG CACCCACCGT CTTGACCCCA AAAGCCCTGG 

47451 A6TGGACAGG GAGCAACTAT ACTGGGAGCT GAGCCAGCTG ACCAATGGCA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQroNO:4) 



47501 TTAAAGAACT GGGCCCCTAC ACCCTGGACA GGAACAGTCT CTATGTC2WVT 

47551 gggt'tcaccc attggatccc tgtgcccacc agcagcactc ctgggacctc 

47601 CACa^GTGGAC CTTGGGTCAG GGACTCCATC CTCCCTCCCC AGCCCCACAA 

47651 CTGCTGGCCC TCTCCTGGTG CCX3TTCACCC TCAACTTCAC CATCACCAAC 

47701 CTGAAGTACG AGGAGGACAT GCATTGCCCT GGCTCCAGGA AGTTCAACAC 

47751 CACAGAGAGA GTCCTGCAGA GTCTGCTTGG TCCCATGTTC AAGAACACCA 

47801 GTGTTGGCCC TCTGTACTCT GGCTGCAGAC TGACCTTGCT CAGGTCCGAG 

47851 AAGGATGGAG CAGCCACTGG AGTGGATGCC ATCTGCACCC ACCGTCTTGA 

47901 CCCCAAAAGC CCTGGA6TGG ACAGGGAGCA GCTATACTGG GAGCTGAGCC 

47951 AGCTGACCAA TGGCATCAAA GAGCTGGGTC CCTACACCCT GGACAGAAAC 

48001 AGTCTCTATG TCAATGGTTT CACCCATCAG ACCTCTGC6C CCAACACCAG 

48051 CACTCCTGGG ACCTCCACAG TGGACCTTGG GACCTCAGGG ACTCCATCCT 

48101 CCCTCCCCAG CCCTACANCN NCTGNCCCTC TCCTOOTNCC NTTCACCOTC 

48151 AACTTNACCA TCACCAACCT GCANTANQJG GANNACATGC NNCNCCCNGG 

48201 NTCCAGGAAG TTCAACACCA CNGAGNGNGT NCTGCAGGGT CTGCTNNNNC 

48251 COITNTTCAA GAACWCCAGT GTNGGCCNTC TGTACTCTGG CTGCAGACTG 

48301 ACCTNNCTCA GGNCNGAGAA GNATGGNGCA GCCACTGGRN TGGATGCCAT 

48351 CTXSCaNGGAG OracmNANC CCA?«UVGNC^ AOIGAGCNGC 

48401 TNTACTGGGA GCTNAGCCAN CTGACCAANN NCATCNNNGA GCTGGCaiCCC 

48451 TACACCCTGG ACAGCTIACAG TCTCTATGTC AATGGTTTCA CCCATTGGAT 

48501 CCCTGTGCCC ACCAGCAGCA CTCCTGGGAC CTCCACAGTG GACCTTGGGT 

48551 CAGGGACTCC ATCXTCCCTC CCCAGCCCCA CAACTGCTGG CCCTCTCCTG 

48601 GTGCCGTTCA'CCCTCAACTr CACCATCACC AACCTGAAGT ACGAGQAGGA 

48651 CATGCATTGC CCTGGCTCCA GGAAGTTCAA CACCACAGAG AGAGTCCTGC 

48701 AGAGTCTGCT TGC3TCCCATG TTCAAQAACA CCAGT6TTGG CCCTCTGTAC 
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Table 4 (continued) 



Human cDNA of CA125 
(S£QIDNO:4) 



48751 TCTGGCTGCA GACTGACCTC GCTCAGGTCC GAGAAGGATG GAGCAGCCAC 

48801 TGGAGTGGAT GCCATCTGCA CCCACCGTGT TGACCCCAAA AGCCCTGGAG 

48851 TGGACAGGGA GCAGCTATAC TGGGAGCTGA GCCAGCTGAC CAATGGCATC 

48901 AAAGAGCTGG GTCCCTACAC CCTGGACAGA AACAGTCTCT ATGTCAATGG 

48951 TTTCACCCAT CAGACCTCTG CGCCCAACAC CAGCACTCCT GGGACCTCCA 

49001 CAGTGNACNT NGGNACCTCN GGGACTCCAT . CCTCCNTCCC CNGCCNCACA 

49051 TCTGCTGGCC CTCTCCTGGT GCCATTCACC CTCAACTTCA CCATCACCAA 

49101 CCTGCAGTAC GAGGAGGACA TGCATCACCC AGGCTCCAGG AAGTTCAACA 

49151 CCACGGAGCG GGTCCTGCAG GGTCTGCTTG GTCCCATGTT CAAGT^CACC 

49201 AGTGTCGGCC TTCTGTACTC TGGCTGCAGA CTGACCTTGC TCAGGCCTGA 

49251 GAAGAATGGG GCAACCACTG GAATGGATGC CATCTGCACC CACCGTCTTG 

49301 ACCCCAAAAG CCCTGGACTG NACAGNGAGC NGCTNTACTG GGAGCTNAGC 

49351 CANCTGACCA ANNNCATCNN NGAGCTGGGN CCCTACACCC TGGACAGGNA 

49401 CAGTCTCTAT GTCAATGGTT TCACCCATCN GANCTCTGNG CCCACCACCA 

49451 GCACTCCTGG GACCTCCACA GTCTTACNTNG ONACCTCNGG GACTCCATCC 

49501 TCCNTCCCCN GCCNCACANC NNCTGNCCCT CTCCTGNTNC CNTTCACCNT 

49551 CAACTTNACC ATCACCAACC TGCANTANGN GGANNACATG CNNCNCCCaSTG 

49661' '6NT GTTCAACACC &CNiSAGA^ TTCTGd\GGG TCTGCTC^ 

49651 CCCTTGTTCA GGAATAGCA6 TCTGGAATAC CTCTATTCAG GCTGCAGACT 

49701 AGCCTCACTC AGGCCAGAGA AGGATAGCTC AGCCATGGCA GTGGATGCCA 

49751 TCTGCACACA TCGCCCTGAC CCTGAAGACC TCGGACTGGA CAGAGAGCGA 

49801 CTGTACTGGG AGCTGAGCAA TCTGACAAAT GGCATCCAGG AGCTGGGCCC 

49851 CTACACCCTG GACCGGAACA GTCTCTATGT CAATGGTTTC ACCCATCGAA 

49901 GCTCTATGCC CACCACCAGC ACTCCTGGGA CCTCCACAGT GGATGTGGGA 

49951 ACCTCAGGGA CTCCATCCTC CAGCCCCAGC CCCACGACTG CTGGCCCTCT 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQ ID NO: 4) 



50001 CCTGATACCA TTCACCCTCA ACTTCACCAT CACCAACCTG CAGTATGGGG 

50051 AGGACATGGG TCACCCTGGC TCCAGGAAGT TCAACACCAC AGAGAGGGTC 

50101 CTGCAGGGTC TGCTTGGTCC CATATTCAAG AACACCAGTG TTGGCCCTCT 

50151 GTACTCTGGC TGCAGACTGA CCTCTCTCAG GTCTGAGAAG GATGGAGCAG 

50201 CCACTGGAGT GGATGCCATC TGCATCCATC ATCTTGACCC CAAAAGCCCT 

50251 GGACTCAACA GAGAGCGGCT GTACTGGGAG CTGAGCCAAC TGACCAATGG 

50301 CATCAAAGAG CTGGGCCCCT ACACCCTGGA CAGGAACAGT CTCTATGTCA 

50351 ATGGTTTCAC CCATCGGACC TCTGTGCCCA CCACCAGCAC TCCTGGGACC 

50401 TCCACAGTGG ACCTTGGAAC CTCAGGGACT CCATTCTCCC TCCCAAGCCC 

50451 CGCAACTGCT GGCCCTCTCC TGGTGCTGTT CACCCTCAAC TTCACCATCA 

50501 CCAACCTGAA GTATGAGGAG GACATGCATC GCCCTGGCTC CAGGAAGTTC 

50551 AACACCACTG AGAGGGTCCT GCAGACTCTG CTTGGTCCTA TGTTCAAGAA 

50601 CACCAGTGTT GGCCTTCTGT ACTCTGGCTG CAGACTGACC TTGCTCAGGT 

50651 ccgagaagga tggagcagcc actggagtgg atgccatctg cacccaccgt 

50701 cttgacccca aaagccctgg actgnacagn gagcngctnt actgggagct 

50751 nagccanctg accaannnca tcnnngagct gggnccctac accctggaca 

50801 ggnacagtct ctatgtcaat ggtttcaccc atcnganctc tgngcccacc 

50851 accagcactc ctgggacctc cacagtgnac ntnggnacct cngggactcc 

50901 atcctccntc cccngccnca cancnnctgn ccctctcctg ntnccnttca 

50951 ccntcaactt naccatcacc aacctgcant angngganna catgcnncnc 

51001 ccnggntcca ggaagttcaa caccacngag agagtccttc agggtctgct 

51051 caggcctgtg ttcaagaaca ccagtgttgg ccctctgtac tctggctgca 

51101 gactgacctt'gctcaggccc aagaaggatg gggcagccac caaagtggat 

51151 gccatctgca cctaccgccc tgatcccaaa agccctggac tggacagaga 

51201 gcagctatac tgggagctga gccagctaac ccacagcatc actgagctgg 
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Table 4 (continued) 



Human cDNA ofCA125 
(SEQIDNO: 4) 



51251 GCCCCTACAC CCAGGACAGG GACAGTCTCT ATGTCAATGG CTTCACCCAT 

51301 CGGAGCTCTG TGCCAACCAC CAGTATTCCT GGGACCTCTG CAGTGCACCT 

51351 GGAAACCACT GGGACTCCAT CCTCCTTCCC CGGCCACACA GAGCCTGGCC 

51401 CTCTCCTGAT ACCATTCACT TTCAACTTTA CCATCACCAA CCTGCGTTAT 

51451 GAGGAAAACA TGCAACACCC TGGTTCCAGG AAGTTCAACA CCACGGAGAG 

51501 GGTTCTGCAG GGTCTGCTCA CGCCCTTGTT CAAGAACACC AGTGTTGGCC 

51551 CTCTGTACTC TGGCTGCAGA CTGACCTTGC TCAGACCTGA GAAGCAGGAG 

51601 GCAGCCACTG GAGTGGACAC CATCTGTACC CACCGCGTTG ATCCCATCGG 

51651 ACCTGGACTG GACAGAGAGC GGCTATACTG GGAGCTGAGC CAGCTGACCA 

51701 ACAGCATCAC AGAGCTGGGA CCCTACACCC TGGATAGGGA CAGTCTCTAT 

51751 GTCGATGGCT TCAACCCTTG GAGCTCTGTG CCAACCACCA GCACTCCTGG 

51801 GACCTCCACA GTGCACCTGG CAACCTCTGG GACTCCATCC CCCCTGCCTG 

51851 GCCACACAGC CCCTGTCCCT CTCTTGATAC CATTCACCCT CAACTTTACC 

51901 ATCACCGACC TGCATTATGA AGAAAACATG CAACACCCTG GTTCCAGGAA 

51951 GTTCAACACC ACGGAGAGGG TTCTGCAGGG TCTGCTCAAG CCCTTGTTCA 

52001 AGAGCACCAG CGTTGGCCCT CTGTACTCTG GCTGCAGACT GACCTTGCTC 

52051 AGACCTGAGA AACATGGGGC AGCCACTGGA GTGGACX5CCA TCTGCACCCT 

52101 CCGCCTTGAT CCCACTGGTC CTGGACTGGA CAGAGAGCGG CTATACTGGG* 

52151 AGCTGAGCCA GCTGACCAAC AGCATCACAG AGCTGGGACC CTACACCCTG 

52201 GATAGGGACA GTCTCTATGT CAATGGCTTC AACCCTTGGA GCTCTGTGCC 

52251 AACCACCAGC ACTCCTGGGA CCTCCACAGT GCACCTGGCA ACCTCTGGGA 

52301 CTCCATCCTC CCTGCCTGGC CACACAACTG CTGGCCCTCT CCTGGTGCCG 

52351 TTCACCCTCA ACTTCACCAT CACCAACCTG AAGTACGAGG AGGACATGCA 

52401 TTGCCCTGGC TCCAGGAAGT TCAACACCAC AGAGAGAGTC CTGCAGAGTC 



52451 TGCATGGTCC CATGTTCAAG AACACCAGTG TTGGCCCTCT GTACTCTGGC 
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Table 4 (continued) 



Human cDNA of CA125 
(S£QIDNO:4) 



52501 TGCAGACTGA CCTTGCTCAG GTCCGAGAAG GATGGAGCAG CCACTGGAGT 

52551 GGATGCCATC TGCACCCACC GTCTTGACCC C3WVAAGCCCT GGACTGNACA 

52601 GNGAGOJGCT NTACTGGGAG CTNAGCCANC TGACCAANNN CATCNNNQAG 

52651 CTGGGNCCCT ACACCCTGGA CAGGNACAGT CTCTATGTCA ATGGTTTCAC 

52701 CCATCNGANC TCTGNGCCCA CCACCAGCAC TCCTGGGACC TCCACAGTGN 

52751 ACaJTNGOrAC CTCNGGGACT CCATCCTCCN TCCCCNGCCN CACANCNNCT 

52801 GNCCCTCTCC TGNTNCCNTT CACCNTCAAC TTNACCATCA CCAACCTGCA 

52851 NTANGMGGAN NACATGCNNC NCCCNGGNTC C3VGGAAGTTC AACACCACNG 

52901 AGNGNGTNCT GCAGGGTCTG CTNNNNCCCN TNTTCAAGAA CNCCAGTGTN 

52951 GGCCNTCT6T ACTCTGGCTG CAGACTGACC TNNCTCAGGN CNGAGAAGNA 

53001 TGGNGCAGCC ACTGGANTGG ATGCCATCTG CANCCACCmr CNTNANCCCA 

53051 AAAGNCX:rrGG ACTGNACAOI GAGCNGCTNT ACTGGGAGCT NAGCCANCTG 

53101 ACCAACAGCA TCACAGAGCT GGGACCCTAC ACCCTGGATA GGGACAGTCT 

53151 CTATGTCAAT GGTTTCACCC ATCGAAGCTC TATGCCCACC ACCAGTATTC 

53201 CTGGGACCTC TGCAGTGCAC CTGGAAACCT CTGGGACTCC AGCCTCCCTC 

53251 CCTGGCCACA CAGCCCCTGG CCCTCTCCTG GTGCCATTCA CCCTCAACTT 

53301 CACTATCACC AACCTGCAGT ATGAGGAGGA CATGCX3TCAC CCTGGTTCCA 

53351 GGAAGTTCAA CACCACGGAG AGAGTCCTGC AGGGTCTGCT CAAGCCCTTG 

53401 TTCAAGAGCA CCAGTGTTGG CCCTCTGTAC TCTGGCTGCA GACTGACCTT 

53451 GCTCAGGCCT GAAAAACGTG GGGCAGCCAC CGGCGTGGAC ACCATCTGCA 

53501 CTC3VCCX3CCT TQACCCTCTA AACCCTGGAC TGNACAGNGA GCNGCTNTAC 

53551 TGGGAGCTNA GCCANCTGAC CAANNNCATC NNNGAGCTGG GNCCCTACAC 

53601 CCTGGACAGG NACAGTCTCT ATGTCAATGG TTTCACCCAT CNGANCTCTG 

53651 NGCCCACCAC C3^GC3^CTCCT GGGACCTCCA CAGTGNACNT NGGNACCTCN 
53701 GGGACTCCAT CCTCCNTCCC CNGCCNCACA NCNNCTGNCC CTCTCCTOilT 
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Table 4 (continued) 



Human cDNA of CA125 
(S£QIDNO:4) 



53751 NCCNTTCACC NTCAACTTNA CCATCACCAA CCTGCANTAN GNGGANNACA 

53801 TGCNNCNCCC NGGNTCCAGG AAGTTCAACA CCACNGAGNG NGTNCTGCAG 

53851 GGTCroCTNN NNCCCNTNTT CAAGAACNCC AGTGTNGGCC NTCTGTACTC 

53901 TGGCTGCAGA CTGACCTNNC TCAGGNCNGA GAAGNATGGN GCAGCCACTG 

53951 GANTGGATGC CATCTGCANC CACCNNCNTN ANCCCAAAAG NCCTGGACT6 

54001 NACAGNGAGC NGCTNTACTG GGAGCTNAGC CANCTGACCA ANNNCATCNN 

54051 NGAGCTGGGN CCCTACACCC TGGACAGGNA CAGTCTCTAT GTCAATGGTT 

54101 TTCACCCTCG GAGCTCTGTG CCAACCACCA GCACTCCTGG GACCTCCACA 

54151 GTGCACCTGG CAACCTCTGG GACTCCATCC TCCCTGCCTG GCCACACAGC 

54201 CCCTGTCCCT CTCTTGATAC CATTCACCCT CAACTTTACC ATCACCAACC 

54251 TGCATTATGA AGAAAACATG CAACACCCTG GTTCCAGGAA GTTCAACACC 

54301 ACGGAGCGGG TCCTGCAGG6 TCTGCTTGGT CCCATGTTCA AGAACACAAG 

54351 TGTCGGCCTT CTGTACTCTG GCTGCAGACT GACCTTGCTC AGGCCTGAGA 

54401 AGAATGGGGC AGCCACTGGA ATGGATGCCA TCTGCAGCCA CCGTCTTGAC 

54451 CCCAAAAGCC CTGGACTGNA CAGMGAGCNG CTNTACTGGG AGCTNAGCCA 

54501 NCTGACCAAN NNCATCNNNG AGCTGGGNCC CTACACCCTG GACAGGNACA 

54551 GTCTCTATGT CAATGGTTTC ACCCATOIGA NCTCTGNGCC CACCACCAGC 

54601 ACTCCTGGGA CCTCCACAGT GNACNTNGGN ACCTCNGGGA CTCCATCCTC 

54651 CNTCCCCNGC CNCACANCNN CTGNCCCTCT CCTGNTNCCN TTCACCNTCA 

54701 ACTTNACCAT CACCAACCTG CANTANGNGG ANNACATGCN NCNCCCNGGN 

54751 TCCAGGAA6T TCAACACCAC NGAGNGNGTN CTGCAGGGTC TGCTNNNNCC 

54801 CNTNTTCAAG AACNCCAGTG TNGGCCNTCT GTACTCTGGC TGCAGACTGA 

54851 CCTNNCTCAG_ GNCNGAGAAG NATGGNGCAG CCACTGGANT GGATGCCATC 

54901 TGCANCGACC NNCNTMANCC CAAAACaiCCT GGACTGNACA GNGAGCNGCT 

54951 NTACTGGGAG CTNAGCCANC TGACCAANNN CATCNNNGAG CTGGGNCCCT 
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Table 4 (continued) 



Human cDNA of CA125 
(S£QIDNO:4) 



55001 



ACACCCTGGA CAGGNACAGT CTCTATGTCA ATGGTTTCAC CCATCAGflAC 



55051 TCTGTGCCCA CCACCAGTAG TCCTGGGACC TCCACAGTGT ACTGGGCAAC 
55101 CACTGGGACT CCATCCTCCT TCCCCGGCCA CACAGAGCCT GGCCCTCTCC 



55151 TGATACCATT CACTTTCAAC TTTACCATCA CCAACCTGCA TTAT6AGGAA 

55201 AACATGCAAC ACCCTGGTTC CAGGAAGTTC AACACCACGG AGAGGGTTCT 

55251 GCAGGGTCTG CTCACGCCCT TGTTCAAGAA CACCAGTGTT GGCCCTCTGT 

55301 ACTCTGGCTG CAGACTGACC TTGCTCAQAC CTGAQAAGCA GGAGGCAGCC 

55351 ACTGGAGTGG ACACCATCTG TACCCACCGC GTTGATCCCA TCGGACCTGG 

55401 ACTGNACAGN GA6CNGCTNT ACTGGGAGCT NAGCCANCTG ACCAANNNCA 

55451 TCNNNGAGCT GGGNCCCTAC ACCCTGGACA GGNACAGTCT CTATGTCAAT 

55501 GGTTTCACCC ATCNGANCTC TGNGCCCACC ACCAGCACTC CTGGGACCTC 

55551 CACAGTGNAC NTNGGNACCT CNGGGACTCC ATCCTCCNTC CCCNGCCNCA 

55601 CANCNNCTOT CCCTCTCCTG NTNCCNTTCA CCNTCAACTT NACCATCACC 

55651 AACCTGCANT ANGNGGANNA CATGCNNCNC COIGOJTCCA GGAAGTTCAA 

55701 CACC3\CNGAG NGNGTNCTGC AGGGTCTGCT NNNNCCCNTN TTCAAGAACN 

55751 CCAGTGTNGG CCNTCTGTAC TCTGGCTGCA GACTGACCTN NCTCAGGNCN 

55801 6AGAAGNATG GNGCAGCCAC TGGANTGGAT GCCATCTGCA NCCACCNNCN 

55851 TNANCCCAAA AGNCCTGGAC TQIACAOIGA GCNGCTNTAC TGGGRGCTMA 

55901 GCCANCTGAC CAANNNCATC NNNGAGCTGG GNCCCTACAC CCTGGACAGG 

55951 NACAGTCTCT ATGTCAATGG TTTCACCCAT CGGAGCTCTG TGCCAACCAC 

56001 CAGCAGTCCT GGGACCTCCA CAGTGCACCT GGCAACCTCT GGGACTCCAT 

56051 CCTCCCTGCC TGGCCACACA GCCCCTGTCC CTCTCTTGAT ACCAITCACC 

56101 CTCAACTTTA CCATCACCAA CCTGCATTAT GAAGAAAACA TGCAACACCC 

56151 TGCTTCCAGG AAGTTCAACA CCACGGAGAG GGTTCTGCAG GGTCTGCTCA 

56201 AGCCCTTGTT CAAGAGCACC AGTGTTGGCC CTCTGTACTC TGGCTGCAGA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



56251 

56301 

56351 

56401 

56451 

56501 

56551 

56601 

56651 

56701 

56751 

56801 

56851 

56901 

56951 

57001 

57051 

57101 

57151 

57201 

57251 

57301 

57351 

57401 



CTGACCTTGC TCAGACCTGA GAAACATGGG GCAGCCACTG GAGTGGACGC 
CATCTGCACC CTCCGCCTTG ATCCCACTGG TCCTGGACTG NACAOC^GAGC 
NGCTNTACTG GGAGCTNAGC CANCTGACCA ANNNCATCNN NGAGCTGGGN 



CCCTACACCC TGGACAGGNA CAGTCTCTAT GTCAATGGTT TCACCCATCN 
GANCTCTGNG CCCACCACCA GCACTCCTGG GACCTCCACA GTGNACNTNG 
GNACCTCNGG GACTCCATCC TCCNTCCCCN GCCNCACANC NNCTGNCCCT 
CTCCTGNTNC CNTTCACCNT CAACTTNACC ATCACCAACC TGCANTANGN 
GGANNACATG CNNCNCCCNG GNTCCAGGAA GTTCAACACC ACNGAOSfGNG 
TNCTGCAGGG TCTGCTNNNN CCCNTNTTCA AGAACNCCAG TGTNGGCCNT 
CTGTACTCTG GCTGCAGACT GACCTNNCTC AGGNCNGAGA AGNATGGNGC 
AGCCACTGGA NTGGATGCCA TCTGCANCCA CCNNCNTNAN CCCAAAAGNC 
CTGGACTGNA CAGNGAGCNG CTNTACTGGG AGCTNAGCCA NCTGACCAAN 
NNCATCNNNG AGCTGGGNCC CTACACCCTG GACAGGNACA GTCTCTATGT 
CAATGGTTTC ACCCATCGGA CCTCTGTGCC CACCACCAGC ACTCCTGGGA 
CCTCCACAGT GCACCTGGCA ACCTCTGGGA CTCCATCCTC CCTGCCTGGC 
CACACAGCCC CTGTCCCTCT CTTGATACCA TTCACCCTCA ACTTTACCAT 
CACCAACCTG CAGTATGAGG AGGACATGCA TCGCCCTGGA TCTAGGAAGT 
TCAACACCAC AGAGAGGGTC CT6CAGGGTC TGCTTAGTCC CATTTTCAAG 
AACTCCAGTG TTGGCCCTCT GTACTCTGGC TGCAGACTGA CCTCTCTCAG 
GCCCGAGAAG GATGGGGCAG CAACTGGAAT GGATGCTGTC TGCCTCTACC 
ACCCTAATCC CAAAAGACCT GG6CTGGACA GAGAGCAGCT GTACTGCGAG 
CTAAGCCAGC TGACCCACAA CATCACTGAG CTGGGCCCCT ACAGCCTGGA 
CAGGGACAGT CTCTATGTCA ATGGTTTCAC CCATCAGAAC TCTGTGCCCA 
CCACCAGTAC TCCTGGGACC TCCACAGTGT ACTGGGCAAC CACTGGGACT 
CCATCCTCCT TCCCCGGCCA CACANCNNCT OJCCCTCTCC TGNTNCCNTT 
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Table 4 (continued) 

Human cDNA of CA12S 
(SEQ ID NO: 4) 



57501 CACCNTCAAC TTNACCATCA CCAACCTGCA NTANGNGGAN NACATGCNNC 

57551 NCCCNGGNTC CAGGAAGTTC AACACCACN6 A6NGNGTNCT GCAGGGTCTG 

57601 CTNNNNCCCN TNTTCAAGAA CNCCA6TGTN GGCCNTCTGT ACTCTGGCTG 

57651 CAGACTGACC TNNCTCAGGN CNGAGAAGNA TGGNGCAGCC ACTGGANTGG 

57701 ATGCCATCTG CANCCACCNN CNTNANCCCA AA7VGNCCTGG ACTGNACAGN 

57751 GAGCNGCTNT ACTGGGAGCT NAGCCANCTG ACCAANNNCA TCNNNGAGCT 

57801 GGGNCCCTAC ACCCTGGACA GGNACAGTCT CTATGTCAAT GGTTTCACCC 

57851 ATTGGAGCTC TGGGCTCACC ACCAGCACTC CTTGGACTTC CACAGTTGAC 

57901 CTTGGAACCT CAGGGACTCC ATCCCCCGTC CCCAGCCCCA CAACTGCTGG 

57951 CCCTCTCCTG GTGCCATTCA CCCTAAACTT CACCATCACC AACCTGCAGT 

58001 ATGAGGAGGA CATGCATCGC CCTGGATCTA GGAAGTTCAA CGCCACAGAG 

58051 AGGGTCCTGC AGGGTCTGCT TAGTCCCATA TTCAAGAACA CCAGTGTTGG 

58101 CCCTCTGTAC TCTGGCTGCA GACTGACCTT GCTCAGACCT GAGAAGCAGG 

58151 AGGCAGCCAC TGGAGTGGAC ACCATCTGTA CCCACCGCGT TGATCCCATC 

58201 GGACCTGGAC TGNACAGNGA GCNGCTNTAC TGGGAGCTNA GCCANCTGAC 

58251 CAANNNCATC NNNGAGCTGG GNCCCTACAC CCTGGACAGG NACAGTCTCT 

58301 ATGTCAATGG TTTCACCCAT CNGANCTCTG NGCCCACCAC CAGCACTCCT 

58351 GGGACCTCCA CAGTGNACNT NGCaJACCTCN GGGACTCCAT CCTCCNTCCC 

58401 CNGCCNCACA NCNNCTGNCC CTCTCCTGNT NCCNTTCACC NTCAACTTNA 

58451 CCATCACCAA CCTGCANTAN GNGGANNACA TGCNNCNCCC NGGNTCCAGG 

58501 AAGTTCAACA CCACNGAGN6 NGTNCTGCAG 6GTCTGCTNN NNCCCNTNTT 

58551 CAAGAACNCC AGTGTNGGCC NTCTGTACTC TGGCTGCAGA CTGACCTNNC 

58601 TCAGCTJCNGA GAAGNATGGN GCAGCCACTG GANTGGATGC CATCTGCANC 

58651 CACCNNCNTN ANCCCAAAAG NCCTGGACTG NACAGNGAGC NGCTNTACTG 

58701 GGAGCTNAGC CANCTGACCA ANNNCATCNN NGAGCTGGGN CCCTACACCC 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



58751 

58801 

58851 

58901 

58951 

59001 

59051 

59101 

59151 

59201 

59251 

59301 

59351 

59401 

59451 

59501 

59551 

59601 

59651 

59701 

59751 

59801 

59851 

59901 

59951 



TGGACAGGNA CAGTCTCTAT GTCAATGGTT TCACCCATCG GAGCTTTGGG 
CTCACCACCA GCACTCCTTG GACTTCCACA GTTGACCTTG GAACCTCAGG 



CATTCACCCT AAACTTCACC ATCACCAACC TGCAGTATGA GGAGGACATG 
CATCGCCCTG GCTCCAGGAA GTTCAACACC ACGGAGAGGG TCCTTCAGGG 
TCTGCTTACG CCCTTGTTCA GGAACACCAG TGTCAGCTCT CTGTACTCTG 
GTTGCAGACT GACCTTGCTC AGGCCTGAGA AGGATGGGGC AGCCACCAGA 
GTGGATGCTG TCTGCACCCA TCGTCCTGAC CCCAAAAGCC CTGGACTGNA 
CAGNGAGCNG CTNTACTGGG AGCTNAGCCA NCTGACCAAN NNCATCNNNG 
AGCTGGGNCC CTACACCCTG GACAGGNACA GTCTCTATGT CAATGGTTTC 
ACCCATCNGA NCTCTGNGCC CACCACCAGC ACTCCTGGGA CCTCCACAGT 
GNACNTNGGN ACCTCNGGGA CTCCATCCTC CNTCCCCNGC CNCACANCNN 
CTGNCCCTCT CCTGNTNCCN TTCACCNTCA ACTTNACCAT CACCAACCTG 
CANTANGNGG ANNACATGCN NCNCCCNGGN TCCAGGAAGT TCAACACCAC 
NGAGNCTTGTN CTGCAGGGTC TGCTNNNNCC CNTNTTCAAG AACNCCAGTG 
TNGGCCNTCT GTACTCTGGC TGCAGACTGA CCTNNCTCAG GNCNGAGAAG 
NATGGNGCAG CCACTGGANT GGATGCCATC TGCANCCACC NNCNTNANCC 
CAAAAGNCCT GGACTGNACA GNGAGCNGCT NTACTGGGAG CTNAGCCANC 
TGACCAANNN CATCNNNGAG CTGGGNCCCT ACACCCTGGA CAGGNACAGT 
CTCTATGTCA ATGGTTTCAC CCATTGGATC CCTGTGCCCA CCAGCAGCAC 
TCCTGGGACC TCCACAGTGG ACCTTGGGTC AGGGACTCCA TCCTCCCTCC 
CCAGCCCCAC AACTGCTGGC CCTCTCCTGG TACCATTCAC CCTCAACTTC 
ACCATCACCA ACCTGCAGTA TGGGGAGGAC ATGGGTCACC CTGGCTCCAG 
GAAGTTCAAC ACCACAGAGA GGGTCCTGCA GGGTCTGCTT GGTCCCATAT 
TCAAGAACAC CAGTGTTGGC CCTCTGTACT CTGGCT6CAG ACTGACCTCT 
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Table 4 (contiaued) 



HumsmcDNAofCAUS 
(SEQIDNO:4) 



60001 
60051 
60101 
60151 
60201 
60251 
60301 
60351 
60401 
60451 
60501 
60551 
60601 
60651 
60701 
60751 
60801 
60851 
60901 
60951 
61001 
61051 
61101 
61151 
61201 



CTCAGGTCCG AGAAGGATGG AGCAGCCACT GGAGTGGATG CCATCTGCAT 



GGGAGCTNAG CCANCTGACC AANNNCATCN NNGAGCTGGG NCCCTACACC 
CTGGACAGGN ACAGTCTCTA TGTCAATGGT TTCACCCATC NGANCTCTGN 
GCCCACCACC AGCACTCCTG GGACCTCCAC AGTGNACNTN GGNACCTCNG 
GGACTCCATC CTCCNTCCCC NGCCNCACAN CNNCTGNCCC TCTCCTGNTN 
CCNTTCACCN TCAACTTNAC CATCACCAAC CTGCANTANG NGGANNACAT 
GCNNCNCCCN GGNTCCAGGA AGTTCAACAC CACNGAGNGN GTNCTGCAGG 
GTCTGCTNNN NCCCNTNTTC AAGAACNCCA GTGTNGGCCN TCTGTACTCT 
GGCTGCAGAC TGACCTNNCT CAGGNCNGAG AAGNATGGNG CAGCCACTGG 
ANTGGATGCC ATCTGCANCC ACCNNCNTNA NCCCAAAAGN CCTGGACTGN 
ACAGNGAGCN GCTNTACTGG GAGCTNAGCC ANCTGACCAA NNNCATCNNN 
GAGCTGGGNC CCTACACCCT GGACAGGNAC AGTCTCTATG TCAATGGTTT 
CACCCATCAG ACCTTTGCGC CCAACACCAG CACTCCTGGG ACCTCCACAG 
TGGACCTTGG GACCTCAGGG ACTCCATCCI CCCTCCCCAG CCCTACATCT 
GCTGGCCCTC TCCTGGTGCC ATTCACCCTC AACTTCACCA TCACCAACCT 
GCAGTACGAG GAGGACATGC ATCACCCAGG CTCCAGGAAG TTCAACACCA 
CGGAGCGGGT CCTGCAGGGT CTGCTTGGTC CCATGTTCAA GAACACCAGT 
GTCGGCCTTC TGTACTCTGG CTGCAGACTG ACCTTGCTCA GGCCTGAGAA 
GAATGGGGCA GCCACCAGAG TGGATGCTGT CTGCACCCAT CGTCCTGACC 
CCAAAAGCCC TGGACTGNAC AGNGAGCNGC TNTACTGGGA GCTNAGCCAN 
CTGACCAANN NCATCNMWGA GCTGGGNCCC TACACCCTGG ACAGGNACAG 
TCTCTATGTC AATGGTTTCA CCCATCNGAN CTCTGNGCCC ACCACCAGCA 
CTCCTGGGAC CTCCACAGTG NACNTNGGNA CCTCNGGGAC TCCATCCTCC 
NTCCCCNGCC NCACAGCCCC TGTCCCTCTC TTGATACCAT TCAGCCTCAA 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



61251 CTTTACCATC ACCAACCTGC ATtATGAAGA AAACATGCAA CACCCTGGTT 

61301 CCAGGAAGTT CAACACCACX5 GAGAGGGTTC TGCAGGGTCT GCTCAAGCCC 

61351 TTGTTCAAGA GCACCAGCGT TGGCCCTCTG TACTCTGGCT GCAGACTGAC 

61401 CTTGCTCAGA CCTGA6AAAC ATGGGGCAGC CACTGGAGTG GACGCCATCT 

61451 GCACCCTCCG CCTTGATCCC ACTGGTCCTG GACTGGACAG AGAGCGGCTA 

61501 TACTGGGAGC TGAGCCAGCT GACCAACAGC GTTACAGAGC TGGGCCCCTA 

61551 CACCCTGGAC AGGGACAGTC TCTATGTCAA TGGCTTCACC CAGCGGAGCT 

61601 CTGTGCCAAC CACCAGTATT CCTGGGACCT CTGCAGTGCA CCTGGAAACC 

61651 TCTGGGACTC CAGCCTCCCT CCCTGGCCAC ACAGCCCCTG GCCCTCTCCT 

61701 GGTGCCATTC ACCCTCAACT TCACTATCAC CAACCTGCAG TATGAGGTGG 

61751 ACATGCGTCA CCCTGGTTCC AGGAAGTTCA ACACCACGGA GAGAGTCCTG 

61801 CAGGGTCTGC TCAAGCCCTT GTTCAAGAGC ACCAGTGTTG GCCCTCTGTA 

61851 CTCTGGCTGC AGACTGACCT. TGCTCAGGCC TGAAAAACGT GGGGCAGCCA 

61901 CCGGCGTGGA CACCATCTGC ACTCACCGCC TTGACCCTCT AAACCCTGGA 

61951 CTGGACAGAG AGCAGCTATA CTGGGAGCTG AGCAAACTGA CCCGTGGCAT 

62001 CATCGAGCTG GGCCCCTACC TCCTGGACAG AGGCAGTCTC TATGTCAATG 

62051 GTTTCACCCA TCGGAACTTT GTGCCCATCA CCAGCACTCC TGGGACCTCC 

62101 ACAGTACACC TAGGAACCTC TGAAACTCCA TCCTCCOTAC CTA 

62151 AGTGCCTGGC CCTCTCCTGG TGCCATTCAC CCTCAACTTC ACCATCACCA 

62201 ACTTGCAGTA TGAGGAGGCC ATGCGACACC CTQGCTCCAG GAAGTTCAAT 

62251 ACCACX3GAGA GGGTCCTACA GG6TCTGCTC AGGCCCTTGT TCAAGAATAC 

62301 CAGTATCGGC CCTCTGTACT CCAGCTGCAG ACTGACCTTG CTCAGGCCAG 

62351 AGAAGGACAA^GGCAGCCACC AGAGTGGATG CCATCTGTAC CCACCACCCT 

62401 GACCCTCAAA GCCCTGGACT GAACAGAGAG CAGCT6TACT GGGAGCTGAG 

62451 CCAGCTGACC CACGGCATCA CTGAGCTGGG CCCCTACACC CTGGACAGGG 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



62501 ACAGTCTCTA TGTCGATGGT TTCACTCATT GGAGCCCCAT ACCGACCACC 

62551 AGCACTCCTG GGACCTCCAT AGTGAACCTG GGAACCTCTG GGATCCCACC 

62601 TTCCCTCCCT GAAACTACAN CNNCTOICCC TCTCCTGNTN CCaJTTCACCN 

62651 TCAACTTNAC CATCACCAAC CTGCANTANG NGGANNACAT GCNNCNCCCN 

62701 GGNTCCAGGA AGTTCAACAC CACNGAGAGG GTTCTGCAGG GTCTGCTCAA 

62751 GCCCTTGTTC AAGAGCACCA GTGTTGGCCC TCTCTATTCT GGCTGCaGAC 

62801 TGACCTTGCT CAGGCCTGAG AAGGACGGAG TAGCCACCAG AGTGGACX3CC 

62851 ATCTGCACCC ACCGCCCTGA CCCCAAAATC CCTGGGCTAG ACAGACAGCA 

62901 GCTATACTGG GAGCTGAGCC AGCTGACCCA CAGCATCACT GAGCTGGGAC 

62951 CCTACACCCT GGATAGGGAC AGTCTCTATG TCAATGGTTT CACCCAGCGG 

63001 AGCTCTGTGC CCACCACCAG CACTCCTGGG ACTTTCACAG TACAGCCGGA 

63051 AACCTCTGAG ACTCCATCAT CCCTCCXTGG CCCCaCAGCC ACTGGCCCTG 

631.01 TCCTGCTGCC ATTCACCCTC AATTTTACCA TCACTAACCT GCAGTATGAG 

63151 GAGGACATGC ATCGCCCTGG CTCCAGGAA6 TTCAACACCA CGQAGAGGGT 

63201 CCTTCAGGGT CTGCTTATGC CCTTGTTCAA GAACACCAGT GTCAGCTCTC 

63251 TGTACTCTGG TTGCAGACTG ACCTTGCTCA GGCCTGAGAA GGATGGGGCA 

63301 GCCACC3VGAG TGGATGCTGT CTGCACCCaT CGTCCTGACC CCAAAAGCCC 

63351 TGGACTGGAC AGAGAGCGGC TGTACTGQAA GCTGAGCCAG CTGACCCAC6 

63401 GCATCACTGA GCTGGGCCCC TACACCCTGG ACAGGCACAG- TCTCTATGTC 

63451 AATGGTTTCA CCCATCAGAG CTCTATGACG ACCACCAGAA CTCCTGATAC 

63501 CTCCACAATG CACCTGGCAA CCTCGAGAAC TCCAGCCTCC CTGTCTGGAC 

63551 CTACXSACCGC CAGCCCTCTC CTGGTGCTAT TCACAATTAA CTTCACCATC 

63601 ACTAACCTGC. GGTATGAGGA GAACATGCAT CACCCTGGCT CTAGAAAGTT 

63651 TAACACCACG GAGAGAGTCC TTCAGG6TCT GCTCAGGCCT GTGTTCAAGA 

63701 ACACCAGTGT TGGCCCTCTG TACTCTGGCT GCAGACTGAC CTTGCTCAGG 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



63751 


CCCAAGAAGG 


ATGGGGCAGC 


CACCAAAGTG GATGCCATCT GCACCTACCG 


63801 


CCCTGATCCC 


AAAAGCCCTG 


GACTGGACAG AQAGCAGCTA TACTGGGAGC 


63851 


TGAGCCAGCT 


AACCCACAGC 


ATCACTGAGC TGGGCCCCTA CACCCTGGAC 


63901 


AGGGACAGTC 


TCTATGTCAA 


TGGTTTCACA CAGCGGAGCT CTGTGCCCAC 


63951 


CACTAGCATT 


CCTGGGACCC 


CCACAGTGGA CCTGGGAACA TCTGGGACTC 


64001 


CAGTTTCTAA 


ACCTGCTCCC 


TCGGCTGCCA GCCCTCTCCT GGTGCTATTC 


64051 


ACTCTCAACT 


TCACCATCAC 


CAACCTGCGG TATGAGGAGA ACATGCAGCA 


64101 


CCCTGGCTCC 


AGGAAGTTCA 


ACACCACGGA GAGGGTCCTT CAGGGCCTGC 


64151 


TCAGGTCCCT 


GTTCAAGAGC 


ACCAGTGTTG GCCCTCTGTA CTCTGGCTGC 


64201 


AGACTGACTT 


TGCTCAGGCC 


TGAAAAGGAT GGGACAGCCA CTGGAGTGGA 


64251 


TGCCATCTGC 


ACCCACCACC 


CTGACCCCAA AAGCCCTAGG CTGGACAGAG 


64301 


AGCAGCTGTA 


TTGGGAGCTG 


AGCCAGCTGA CCCACAATAT CACTGAGCTG 


64351 


GGCCACTATG 


CCCTGGACAA 


CXSACAGCCTC TTTGTCAATG GTTTCACTCA 


64401 


TCGGAGCTCT 


GTGTCCACCA 


CCAGCACTCC TGGGACCCCC ACAGTGTATC 


64451 


TGGGAGCATC 


TAAGACTCCA 


GCCTCGATAT TTGGCCCTTC AGCTGCCAGC 


64501 


CATCTCCTGA 


TACTATTCAC 


CCTCAACTTC ACCATCACTA ACCTGCGGTA 


64551 


TGAGGAGAAC 


ATGTGGCCTG 


GCTCCAGGAA GTTCAACACT ACAGAGAGGG 


64601 


TCCTTCAGGG 


CCTGCTAAGG 


CCCTTGTTCA AGAACACCAG TGTTGGCCCT 


64651 


CTGTACTCTG 


GCTCCAGGCT 


GACCTTGCTC AGGCCAGAGA AAGATGGGGA 


64701 


AGCCACCGGA 


GTGGATGCCA 


TCTGCACCCA CCGCCCTGAC CCCACAGGCC 


64751 


CTGGGCTGGA 


CAGAGAGCAG 


CTGTATTTGG AGCTGAGCCA GCTGACCCAC 


64801 


AGCATCACTG 


AGCTGGGCCC 


CTACACACTG GACAGGGACA GTCTCTATGT 


64851 


CAATGGTTTC 


ACCCATCGGA 


GCTCTGTACC CACCACCAGC ACCGGGGTGG 


64901 


TCAGCGAGGA 


GCCATTCACA 


CTGAACTTCA CCATCAACAA CCTGCGCTAC 


64951 


ATGGCX3GACA 


TGGGCCAACC 


CGGCTCCCTC AAGTTCAACA TCACAGACAA 
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Table 4 (continued) 



Human cDNA of CA12S 
(SEQroNO:4) 



65001 CGTCATGAAG CACCTGCTCA GTCCTTTGTT CCAGAGGAGC AGCCTGGGTG 

65051 CACGGTACAC AGGCTGCAGG GTCATCGCAC TAAGGTCTGT GAAGAACGGT 

65101 GCTGAGACAC GGGTGGACCT CCTCTGCACC TACCTGCAGC CCCTCAGCGG 

65151 CCCAGGTCTG CCTATCAAGC AGGTGTTCCA TGAGCTGAGC CAGCAGACCC 



65201 ATGGCATCAC CCGGCTGGGC CCCTACTCTC TGGACAAAGA CAGCCTCTAC 
65251 CTTAACGGTT ACAATGAACC TGGTCTAGAT GAGCCTCCTA CAACTCCCAA 



65351 GGTACCACCT GAAGACCCTC ACACTCAACT TCACCATCTC CAATCTCCAG 

65401 TATTCACCAG ATATGGGCAA GGGCTCAGCT ACATTCAACT CCACCGAGGG 

65451 GGTCCTTCAG CACCTGCTCA GACCCTTGTT CCAGAAGAGC AGCATGGGCC 

65501 CCTTCTACTT GGGTTGCCAA CTGATCTCCC TCAGGCCTGA GAAGGATGGG 

65551 GCAGCCACTG GTGTGGACAC CACCTGCACC TACCACCCTG ACCCTGTGGG 

65601 CCCCGGGCTG GACATACAGC AGCTTTACTG GGAGCTGAGT CAGCTGACCC 

65651 ATGGTGTCAC CCAACTGGGC TTCTATGTCC TGGACAGGGA TAGCCTCTTC 

65701 ATCAATGGCT ATGCACCCCA GAATTTATCA ATCCGGGGCG AGTACCAGAT 

65751 AAATTTCCAC ATTGTCAACT GGAACCTCAG TAATCCAGAC CCCACATCCT 

65801 CAGAGTACAT CACCCTGCTG AGGGACATCC AGGACAAGGT CACCACACTC 

65851 TACAAAGGCA GTCAACTACA TGACACATTC CGCTTCTGCC TGGTCACCAA 

65901 CTTGACGATG GACTCOGTGT TGGTCACTGT CAAGGCATTG TTCTCCTCCA 

65951 ATTTGGACCC CAGCCTGGTG GAGCAAGTCT TTCTAGATAA GACCCTGAAT 

66001 GCCTCATTCC ATTGGCTGGG CTCCACCTAC CAGTTGGTGG ACATCCATGT 

66051 GACAGAAATG GAGTCATCAG TTTATCAACC AACAA6CAGC TCCAGCACCC 

66101 AGCACTTCTA CCTGAATTTC ACCATCACCA ACCTACCATA TTCCCAGGAC 

66151 AAAGCCCAGC CAGGCACCAC CAATTACCAG AGGAACAAAA GGAATATTGA 

66201 GGATGCGCTC AACCAACTCT TCCGAAACAG CAGCATCAAG AGTTATTTTT 



65301 



GCCAGCCACC ACATTCCTGC CTCCTCTGTC AGAAGCCACA ACAGCCATGG 
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Table 4 (continued) 



Human cDNA of CA125 
(SEQIDNO:4) 



66251 


CTGACTGTCA AGTTTCAACA 


TTCAGGTCTG 


TCCCCAACAG 


GCACCACACC 


66301 


GGGGTGGACT CCCTGTGTAA 


CTTCTCGCCA 


CTGGCTCGGA 


GAGTAGACAG 


66351 


AGTTGCCATC TATGAGGAAT 


TTCTGCGGAT 


GACCCGGAAT 


GGTACCCAGC 


66401 


TGCAGAACTT CACCCTGGAC 


AGGAGCAGTG 


TCCTTGTGGA 


TGGGTATTCT 


66451 


CCCAAC2W3AA ATGAGCCCTT 


AACTGGGAAT 


TCTGACCTTC 


CCTTCTGGGC 


66501 


TGTCATCCTC ATCGGCTTGG 


CAGGACTCCT 


GGGACTCATC 


ACATGCCTGA 


66551 


TCTGCGGTGT CCTGGTGACC 


ACCCGCCGGC 


GGAAGAAGGA AGGAGAATAC 


66601 


AACGTCCAGC AACAGTGCCC 


AGGCTACTAC 


CAGTCACACC 


TAGACCTGGA 


66651 


GGATCTGCAA TGACTGGAAC 


TTGCCGGTGC 


CTGGGGTGCC 


TTTCCCCCAG 


66701 


CCAGGGTCCA AAGAAGCTTG 


GCTGGGGCAG AAATAAACCA TATTGGTCGG 


66751 


AAAAAAAAAA AAAAA 
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Tables 



Haman Protein of CA125 Molecule 
(SEQIDNO:5) 



1 


MLKPSGLPGS SSPTRSLMTG SRSTKATPEM DSGLTGATLS 


PKTSTGAIW 


51 


TEHTLPFTSP DKTLASPTSS WGRTTQSLG VMSSALPEST SRGMTHSEQR 


101 


TSPSLSPQVN GTPSRNYPAT SMVSGLSSPR TRTSSTEGNF 


TKEASTYTIiT 


151 


VETTSGPVTE KYTVPTETST TEGDSTETPW DTRYIPVKIT 


SPMKTFADST 


201 


ASKENAPVSM TPAETTVTDS HTPGRTNPSF GTLYSSFLDL 


SPKGTPNSRG 


251 


ETSLELILST TGYPFSSPEP GSAGHSRIST SAPLSSSASV 


LDNKISETSI 


301 


FSGQSLTSPL SPGVPEARAS TMPNSAIPFS MTLSNAETSA 


ERVRSTISSL 


351 


GTPSISTKQT AETILTFHAF AETMDIPSTH lAKTLASEWL 


GSPGTLGGTS 


401 


TSALTTTSPS TTLVSEETNT HHSTSGKETK GTLNTSMTPL 


ETSAPGEESE 


451 


MTATLVPTLG FTTLDSKIRS PSQVSSSHPT RELRTTGSTS GRQSSSTAAH 


501 


GSSDILRATT SSTSKASSWT SESTAQQFSE PQHTQWVETS 


PSMKTERPPA 


551 


STSVAAPITT SVPSWSGFT TLKTSSTKGI WLEETSADTL 


IGESTAGPTT 


601 


HQFAVPTGIS MTGGSSTRGS QGTTHLLTRA TASSBTSADL 


TLATNGVPVS 


651 


VSPAVSKTAA GSSPPGGTKP SYTMVSSVIP BTSSLQSSAF 


REGTSLGLTP 


701 


IiNTRHPFSSP EPDSAGHTKI STSIPLLSSA SVLEDKVSAT 


STFSHHKATS 


751 


SITTGTPEIS TKTKPSSAVL SSMTLSNAAT SPERVRNATS 


PLTHPSPSGE 


801 


ETAGSVLTLS TSAETTDSPN IHPTGTLTSE SSESPSTLSL 


PSVSGVKTTF 


851 


SSSTPSTHLF TSGEBTEBTS NPSVSQPETS VSRVRTTLAS 


TSVPTPVFPT 


901 


MDTWPTRSAQ FSSSHLVSEL RATSSTSVTN STGSALPKIS HLTGTATMSQ 


951 


TNRDTFNDSA APQSTTWPET SPRFKTGLPS ATTTVSTSAT 


SLSATVMVSK 


1001 


FTSPATSSME ATSIRBPSTT ILTTBTTNGP GSMAVASTNI 


PIGKGYITEG 


1051 


RLDTSHLPIG TTASSETSMD FTMAKESVSM SVSPSQSMDA AGSSTPGRTS 


1101 


QFVDTFSDDV YHLTSREITI PRDGTSSALT PQMTATHPPS 


PDPGSARSTW 


1151 


LGILSSSPSS PTPKVTMSST FSTQRVTTSM IMDTVETSRW NMPNLPSTTS 


1201 


LTPSNIPTSG AIGKSTLVPL DTPSPATSLE ASEGGLPTLS TYPESTNTPS 
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Table 5 (continued) 



Human Protdn of CA125 Molecule 
(SEQIDNO:5) 



1251 


IHLGAHASSB 


SPSTINLTMA 


SWKPGSYTP LTFPSIETHI HVSTARMAYS 


1301 


SGSSPEMTAP 


GBTNTGSTWD 


PTTYITTTDP KDTSSAQVST PHSVRTLRTT 


1351 


ENHPKTESAT 


PAAYSGSPKI 


SSSPNLTSPA TKAWTITDTT EHSTQLHYTK 


1401 


LAEKSSGFET 


QSAPGPVSW IPTSPTIGSS TLELTSDVPG EPLVLAPSEQ 


1451 


TTITLPMATW 


LSTSLTEEMA 


STDLDISSPS SPMSTFAIFP PMSTPSHELS 


1501 


KSEADTSAIR 


NTDSTTLDQH 


LGIRSLGRTG DLTTVPITPL TTTWTSVIEH 


1551 


STQAQDTLSA 


TMSPTHVTQS 


LKDQTSIPAS ASPSHLTEVY PBLGTQGRSS 


1601 


SEATTFWKPS 


TDTLSREIET GPTNIQSTPP MDNTTTGSSS SGVTLGIAHL 


1651 


PIGTSSPAET 


STNMALERRS 


STATVSMAGT MGLLVTSAPG RSISQSLGRV 


1701 


SSVLSESTTE 


GVTDSSKGSS 


PRLNTQGNTA LSSSLEPSYA EGSQMSTSIP 


1751 


LTSSPTTPDV 


EFIGGSTPWT 


KEVTTVMTSD ISKSSARTES SSATLMSTAL 


1801 


GSTENTGKEK 


LRTASMDLPS 


PTPSMBVTPW ISLTLSNAPN TTDSLDLSHG 


1851 


VHTSSAGTLA 


TDRSLNTGVT 


RASRLENGSD TSSKSLSMGN STHTSMTDTB 


1901 


KSEVSSSIHP 


RPETSAPGAB 


TTLTSTPGNR AISLTLPFSS IPVEEVISTG 


1951 


ITSGPDINSA 


PMTHSPITPP TIVWTSTGTI BQSTQPLHAV SSBKVSVQTQ 


2001 


STPYVNSVAV 


SASPTHENSV 


SSGSSTSSPY SSASLESLDS TISRRNAITS 


2051 


WLWDLTTSLP 


TTTWPSTSLS 


EALSSGHSGV SNPSSTTTEF PLFSAASTSA 


2101 


AKQENPETET 


H6PQNTAAST 


LNTDASSVTG LSETPVGASI SSEVPLPMAI 


2151 


TSRSDVSGLT 


SESTANPSL6 


TASSAGTKLT RTISLPTSES LVSFRMNKDP 


2201 


WTVSIPLGSH 


PTTNTETSIP 


VNSAGPPGLS TVASDVIDTP SDGAESIPTV 


2251 


SFSPSPDTEV 


TTISHFPEKT 


THSFRTISSL THBLTSRVTP IPGDWMSSAM 


2301 


STKPTGASPS 


ITLGERRTIT 


SAAPTTSPIV LTASFTBTST VSLDNETTVK 


2351 


TSDILDARKT 


NELPSDSSSS 


SDLINTSIAS STMDVTKTAS ISPTSISGMT 


2401 


ASSSPSLFSS 


DRPQVPTSTT 


ETNTATSPSV SSNTYSLDGG SNVGGTPSTL 


2451 


PPFTITHPVE 


TSSALLAWSR 


PVRTFSTMVS TDTASGENPT SSNSWTSVP 
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Table 5 (continued) 



Human Protein of CA125 Molecule 
(SEQIDNO:S) 



2501 

2551 

2601 

2651 

2701 

2751 

2801 

2851 

2901 

2951 

3001 

3051 

3101 

3151 

3201 

3251 

3301 

3351 

3401 

3451 

3501 

3551 

3601 

3651 

3701 



APGTWASVGS TTDLPAMGFL KTSPAGEAHS LLASTIEPAT AFTPHLSAAV 
VTGSSATSEA SLLTTSESKA IHSSPQTPTT PTSGANWETS ATPESLLWT 
ETSDTTLTSK ILVTDTILFS TVSTPPSKFP STGTLSGASF PTLLPDTPAI 
PLTATEPTSS LATSFDSTPL VtlASDSLGT VPETTLTMSE TSNGDALVLK 
TVSNPDRSIP GITIQGVTES PLHPSSTSPS KIVAPRNTTY EGSITVALST 
LPAGTTGSLV FSQSSENSET TALVDSSAGL ERASVMPLTT GSQGMASSGG 
IRSGSTHSTG TKTFSSLPLT MNPGEVTAMS EITTNRLTAT QSTAPKGIPV 
KPTSAESGLL TPVSASSSPS KAFASLTTAP PSTWGIPQST LTFEFSEVPS 
LDTKSASLPT PGQSLNTIPD SDASTASSSL SKSPEKNPRA RMMTSTKAIS 
ASSFQSTGFT ETPEGSASPS MAGHEPRVPT SGTGDPRYAS ESMSYPDPSK 
ASSAMTSTSL ASKLTTLFST GQAARSGSSS SPISLSTEKE TSFLSPTAST 
SRKTSLFLGP SMARQPNILV HLQTSALTLS PTSTLNMSQB EPPELTSSQT 
lAEEEGTTAE TQTLTFTPSE TPTSLLPVSS PTEPTARRKS SPETWASSIS 
VPAKTSLVET TDGTLVTTIK MSSQAAQGNS TWPAPAEETG TSPAGTSPGS 
PEVSTTLKIM SSKEPSISPB IRSTVRNSP^ KTPETWPME TTVEPVTLQS 
TALGSGSTSI SHLPTGTTSP TKSPTENMLA TERVSLSPSP PEAWTNLYSG 
TPGGTRQSLA TMSSVSLBSP TARSITGTGQ QSSPELVSKT TGMEFSMWHG 
STGGTTGDTH VSLSTSSNIL EDPVTSPNSV SSLTDKSKHK TETWVSTTAI 
PSTVLNNKIM AAEQQTSRSV DEAYSSTSSW ' SDQTSGSDIT LGASPDVTNT 
LYITSTAQTT SLVSLPSGDQ GITSLTNPSG GKTSSASSVT SPSIGLETLR 
ANVSAVKSDI APTAGHLSQT SSPABVSILD VTTAPTPGIS TTITTMGTNS 
ISTTTPNPEV GMSTMDSTPA TERRTTSTEH PSTWSSTAAS DSWTVTDMTS 
NLKVARSPGT ISTMHTTSFL ASSTELDSMS TPHGRITVIG TSLVTPSSDA 
SAVKTETSTS ERTLSPSDTT ASTPISTFSR VQRMSISVPD ILSTSWTPSS 
TEAEDVPVSM VPTDHASTKT DPNTPLSTFL FDSLSTLDWD TGRSLSSATA 
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Table 5 (continued) 



Human Protein of CA125 Molecule 
(SEQIDNO:5) 



3751 

3801 

3851 

3901 

3951 

4001 

4051 

4101 

4151 

4201 

4251 

4301 

4351 

4401 

4451 

4501 

4551 

4601 

4651 

4701 

4751 

4801 

4851 

4901 

4951 



TTSAPQGATT PQELTLETMI SPATSQLPFS IGHITSAVTP AAMARSSGVT 
FSRPDPTSKK AEQTSTQLPT TTSAHPGQVP RSAATTLDVI PHTAKTPDAT 
FQRQGQTALT TEARATSDSW NEKBKSTPSA PWITEMMNSV SEDTIKEVTS 
SSSVLKDPEY AGHKLGIWDD FIPKFGKAAH MRELPLLSPP QDKEAIHPST 
NTVETTGWVT SSEHASHSTI PAHSASSKLT SPWTTSTRB QAIVSMSTTT 
WPESTRRRTE PNSFLTIELR DVSPYMDTSS TTQTSIISSP GSTAITKGHR 
TEITSYKRIS SSFLAQSMRS SDSPSEAITR LSNFPAMTES GGMILM4QTS 
PPGATSISAP TLDTSATASW TGTPLATTQR PTYSBKTTLF SKGREDTSQP 
SPPCVEETSS SSSWPIHAT TSPSNILLTS QGHSPSSTPP VTSVFLSETS 
GLGKTTDMSR ISLEPGTSLP PNLSSTAGBA LSTYEASRDT KAIHHSADTA 
VTNMEATSSE YSPIPGHTKP SKATSPLVTS HIMGDITSST SVFGSSBTTE 
lETVSSVNQG LQBRSTSQVA SSATETSTVI THVSSGDATT HVTKTQATFS 
SGTSISSPHQ FITSTNTFTD VSTNPSTSLI MTESSGVTIT TQTGPTGAAT 
QGPYLLDTST MPYLTETPLA VTPDFMQSEK TTLISKGPKD VTWTSPPSVA 
ETSYPSSLTP FLVTTIPPAT STLQGQHTSS PVSATSVLTS GLVKTTDMLN 
TSMEPVTNSP QNLNNPSNEI LATLAATTDI ETIHPSINKA VTNMGTASSA 
HVLHSTLPVS SEPSTATSPM VPASSMGDAL AS1SIP6SET TDIEGBPTSS 
LTAGRKENST LQEMNSTTES NIILSNVSVG AITEATKMEV PSFDATPIPT 
PAQSTKFPDI FSVASSRLSN SPPMTISTHM TTTQTGSSGA TSKIPLALDT 
STLETSAGTP SWTEGFAHS KITTAMNNDV KDVSQTNPPP QDEASSPSSQ 
APVLVTTLPS SVAFTPC2WHS TSSPVSMSSV LTSSLVKTAfi KVDTSLETVT 
SSPQSMSNTL DDISVTSAAT TDIETTHPSI NTWTNVGTT GSAFESHSTV 
SAYPEPSKVT SPNVTTSTME DTTISRSIPK SSKTTRTETE TTSSLTPKLR 
BTSISQEITS STBTSTVPYK BLTGATTEVS RTDVTSSSST SFPGPDQSTV 
SLDISTETNT RLSTSPIMTE SAEITITTQT GPHGATSQDT FTMDPSNTTP 
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Table 5 (continued) 



Human Protein of CA125 Molecule 
(SEQIDNO:5) 



5001 QAGIHSAMTH GFSQLDVTTL MSRIPQDVSW TSPPSVDKTS SPSSFLSSPA 

5051 MTTPSLISST LPEDKLSSPM TSLLTSGLVK ITDILRTRLE PVTSSLPNFS 

5101 STSDKILATS KDSKDTKEIF PSINTEETNV KANNSGHESH SPALADSETP 

5151 KATTQMVITT TVGDPAPSTS MPVHGSSETT NIKREPTYFL TPRLRETSTS 

5201 QESSFPTDTS FLLSKVPTGT ITEVSSTGVI SSSKISTPDH DKSTVPPDTF 

5251 TGEIPRVFTS SIKTKSAEMT ITTQASPPES ASHSTLPLDT STTLSQGGTH 

5301 STVSQGFPYS EVTTLMGMGP GNVSWMTTPP VEETSSVSSL MSSPAMTSPS 

5351 PVSSTSPQSI PSSPLPVTAL PTSVLVTTTD VLGTTSPESV TSSPPNLSSI 

5401 THERPATYKD TAHTEAAMHH STNTAVTNVG TSGSGHKSQS SVLADSETSK 

5451 ATPLMSTAST LGDTSVSTST PNISQTNQIQ TEPTASLSPR LRESSTSEKT 

5501 SSTTETNTAF SYVPTGAITQ ASRTBISSSR TSISDLDRST lAPDISTGMI 

5551 TRLFTSPIMT KSAEMTVTTQ TTTPGATSQG ILPWDTSTTL FQGGTHSTVS 

5601 QGFPHSEITT LRSRTPGDVS WMTTPPVEET SSGFSLMSPS MTSPSPVSST 

5651 SPESIPSSPL PVTALLTSVL VTTTNVLGTT SPEPVTSSPP NLSSPTQERL 

5701 TTYKDTAHTE AMHASMHTNT AVANVGTSIS GHESQSSVPA DSHTSKATSP 

5751 MGITFAMGDT SVYTSTPAFF ETRIQSESTS SLIPGLRDTR TSEEINTVTE 

5801 TSTVLSEVPT TTTTEVSRTE VITSSRTTIS GPDHSKMSPY ISTETITRLS 

5851 TFPFVTGSTB MAITNQTGPI GTISQATLTL DTSSTASWEG THSPVTQRFP 

5901 HSEETTTMSR STKGVSWQSP PSVEETSSPS SPVPLPAITS HSSLYSAVSG 

5951 SSPTSALPVT SLLTSGRRKT IDMLDTHSBL VTSSLPSASS FSGEILTSEA 

6001 STNTETIHFS ENTAETNMGT TNSMHKLHSS VSIHSQPSCffl TPPKVT6SMM 

6051 EDAIVSTSTP GSPETKNVDR DSTSPLTPEL KEDSTALVMN STTESNTVFS 

6101 SVSLDAATEV SRAEVTYYDP TFMPASAQST KSPDISPEAS SSHSNSPPLT 

6151 ISTHKTIATQ TGPSGVTSLG QLTLDTSTIA TSAGTPSART QDFVDSETTS 

6201 VMNNDLNDVL KTSPFSAEEA NSLSSQAPLL VTTSPSPVTS TLQEHSTSSL 
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Table 5 (contisued) 



Human Protein of CA125 Molecule 
(S£QIDNO:5) 



6251 

6301 

6351 

6401 

6451 

6501 

6551 

6601 

6651 

6701 

6751 

6801 

6851 

6901 

6951 

7001 

7051 

7101 

7151 

7201 

7251 

7301 

7351 

7401 

7451 



VSVTSVPTPT LAKITDMDTN LEPVTRSPQN LRNTLATSEA TTDTHTMHPS 
INTAMANVGT TSSPNEFYFT VSPDSDPYKA TSAWITSTS GDSIVSTSMP 
RSSAMKKIES ETTFSLIFRL RETSTSQKIG SSSDTSTVFD KAFTAATTEV 
SRTELTSSSR TSIQGTBKPT MSPDTSTRSV TMLSTFAGLT KSEERTIATQ 
TGPHRATSQ6 TLTWDTSITT SQAGTHSAMT HGFSQLDLST LTSRVPBYIS 
GTSPPSVEKT SSSSSLLSLP AITSPSPVPT TLPESRPSSP VHLTSLPTSG 
LVKTTDMLAS VASLPPNLGS TSHKIPTTSE DIKDTEKMYP STNIAVTNVG 
TTTSEKESYS SVPAYSEPPK VTSPMVTSFN IRDTIVSTSM PGSSEITRIE 
MESTFSLAHG LKGTSTSQDP IVSTEKSAVL HKLTTGATET SRTEVASSRR 
TSIPGPDHST ESPDISTEVI PSLPISLGIT ESSNMTIITR TGPPLGSTSQ 
GTFTLDTPTT SSRAGTHSMA TQEFPHSBMT TVMNKDPEIL SWTIPPSIEK 
TSFSSSLMPS PAMTSPPVSS TLPKTIHTTP SPMTSLLTPS LVMTTDTLGT 
SPEPTTSSPP NLSSTSHVIIi TTDEDTTAIE AMHPSTSTAA TNVETTCSGH 
GSQSSVLTDS EKTKATAPMD TTSTMGHTTV STSMSVSSET TKIKRESTYS 
LTPGLRBTSI SQNASPSTDT SIVLSEVPTG TTAEVSRTEV TSSGRTSIPG 
PSQSTVLPEI STRTMTRLFA SPTMTESJ^M TIPTQTGPSG STSQDTLTLD 
TSTTKSQAKT HSTLTQRFPH SBMTTLNSRG PGDMSWQSSP SLENPSSLPS 
LLSLPATTSP PPISSTLPVT ISSSPLPVTS DMLHTSPELV 
TSSPPKLSHT SDERLTTGKD- TTKPTEAVHPS TNTAASNVEI PSFGHESPSS 
ALADSBTSKA TSPMFITSTQ EDTTVAISTP HFLETSRIQK ESISSLSPKL 



RBTGSSVBTS SAIETSAVLS EVSIGATTBl SRTEVTSSSR TSISGSAEST 
MLPEISTTRK IIKFPTSPIL AESSEMTIKT QTSPPGSTSE STFTLDTSTT 
PSLVITHSTM TQRLPHSEIT TLVSRGAGDV PRPSSLPVEE TSPPSSQLSL 
SAMISPSPVS STLPASSHSS SASVTSPLTP GQVKTTBVLD ASABPETSSP 
PSLSSTSVEI LATSEVTTDT EKIHPFPNTA VTKVGTSSSG HESPSSVLPD 
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Tables (continued) 



Human Protdn of CA125 Molecule 
(SEQroNO:5) 



7501 SETTKATSAM GTISIMGDTS VSTLTPALSN TRKIQSEPAS SLTTRLRETS 
7551 TSEETSLATE ANTVLSKVST GATTEVSRTE AISFSRTSMS GPEQSTMSQD 
7601 ISIGTIPRIS ASSVLTESAK MTITTQTGPS ESTLESTLNL NTATTPSWVE 
7651 THSIVIQGFP HPEMTTSMGR GPGGVSWPSP PFVKETSPPS SPLSLPAVTS 
7701 PHFVSTTPLA HIPPSPLPVT SLLTSGPATT TDILGTSTEP GTSSSSSLST 
7751 TSHERLTTYK DTAHTEAVHP STNTGGTNVA TTSSGYKSQS SVLADSSPMC 
7801 TTSTMGDTSV LTSTPAFLET RRIQTELASS LTPGLRESSG SEGTSSGTKM 
7851 STVLSKVPT6 ATTEISKEDV TSIPGPAQST ISPDTSTRTV SWFSTSPVMT 
7901 ESAEITMNTH TSPLGATTQG TSTLDTSSTT SLTMTHSTIS QGFSHSQMST 
7951 LMRRGPEDVS WMSPPLLEKT RPSFSLMSSP ATTSPSPVSS TLPESISSSP 
8001 LPVTSLLTSG LAKTTDMLHK SSEPVTNSPA NLSSTSVEIL ATSEVTTDTE 
8051 KTHPSSNRTV TDVGTSSSGH ESTSFVLADS QTSKVTSPMV ITSTMEDTSV 
8101 STSTPGFFET SRIQTEPTSS LTLGLRKTSS SEGTSLATEM STVLSGVPTG 
8151 ATAEVSRTEV TSSSRTSIS6 FAQLTVSPET STETITRLPT SSIMTESAEM 
8201 MIKTQTDPPG STPBSTHTVD ISTTPNWVET HSTVTQRFSH SBMTTLVSRS 
8251 PODMLWPSQS SVBETSSASS LLSLPATTSP SPVSSTLVBD FPSASLPVTS 
8301 LLTPGLVITT DRMGISREP6 TSSTSNLSST SHERLTTLED TVDTBAMQPS 
8351 THTAVTNVRT SISGHESQSS VLSDSETPKA TSSMGTTYTM (ffiTSVSISTS 
8401 DFFETSRVQI EPTSSLTSGL RBTSSSERIS SATE6STVLS BVPSGATTBV 
8451 SRTBVISSRG TSMSGPDQFT ISPDISTEAI TRLSTSPIMT BSAESAITIE 
8501 TGSPGATSEQ TLTLDTSTTT FWSCTHSTAS PGFSHSEMTT LMSRTPODVP 
8551 WPSLPSVEEA SSVSSSLSSP AMTSTSFFSA LPESISSSPH PVTALLTLGP 
8601 VKTTDMLRTS SEPETSSPPN LSSTSAEILA TSE^7TKDREK IHPSSNTPW 
8651 NVGTVIYKHL SPSSVLADLV TTKPTSPMAT TSTLGNTSVS TSTPAFPETM 
8701 MTQPTSSLTS GLREISTSQE TSSATERSAS LSGMPTGATT KVSRTEALSL 
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Human Protein of CA125 Molecule 
(S£QIDNO:5) 

8751 GRTSTPGPAQ STISPEISTE TITRISTPLT TTGSAEMTIT PKTGHSGASS 

8801 QGTFTLDTSS RASWPGTHSA ATHRSPHSCM TTPMSRGPED VSWPSRPSVE 

8851 KTSPPSSLVS LSAVTSPSPL YSTPSESSHS SPLRVTSLFT PVMMKTTDML 

8901 DTSLEPVTTS PPSMNITSDE SLATSKATME TEAIQLSENT AVTQMGTISA 

8951 RQEFYSSYPG LPEPSKVTSP WTSSTIKDI VSTTIPASSE ITRIEMESTS 

9001 TLTPTPRETS TSQEIHSATK PSTVPYKALT SATIEDSMTQ VMSSSRGPSP 

905! DQSTMSQDIS TEVITRLSTS PIKAESTEMT ITTQTGSPGA TSRGTLTLDT 

9101 STTFMSGTHS TASQGFSHSQ MTALMSRTPG DVPWLSHPSV EEASSASFSL 

9151 SSPVMTSSSP VSSTLPDSIH SSSLPVTSLL TSGLVKTTBL LGTSSEPETS 

9201 SPPNLSSTSA EILATTEVTT DTEKLEMTNV VTSGYTHESP SSVLADSVTT 

9251 KATSSMGITY PTGDTNVLTS TPAFSDTSRI QTKSKLSLTP GLMETSISEE 

9301 TSSATEKSTV LSSVPT6ATT EVSRTEAISS SRTSIPGPAQ STMSSDTSMB 

9351 TITRISTPLT RKESTDMAIT PKTGPSGATS QGTFTLDSSS TASWPGTHSA 

9401 TTQRFPQSW TTPMSRGPED VSWPSPLSVE KNSPPSSLVS SSSVTSPSPL 

9451 YSTPSGSSHS SPVPVTSLFT SIMMKATDML DASLEPETTS APNMNITSDE 

9501 SLATSKATTE TEAIHVFENT AASHVETTSA TEELYSSSPG FSEPTKVISP 

9551 WTSSSIRDN MVSTTMPGSS GITRIEIESM SSLTPGLRET RTSQDITSST 

9601 ETSTVLYKMS SGATPEVSRT EVMPSSRTSI PGPAQSTMSL DISDEWTRL 

9651 STSPIMTESA EITITTQTGY SLATSQVTLP LGTSMTFLSG THSTMSQGLS 

9701 HSEMTNLMSR GPESLSWTSP RFVETTRSSS SLTSLPLTTS LSPVSSTLLD 

9751 SSPSSPLPVT SLILPGLVKT TEVLDTSSEP KTSSSPNLSS TSVEIPATSE 

9801 IMTDTEKIHP SSNTAVAKVR TSSSVHESHS SVLADSETTI TIPSMGITSA 

9851 VDDTTVFTSN PAFSETRRIP TEPTFSLTPG FRETSTSEET TSITETSAVL 

9901 YGVPTSATTE VSMTEIMSSN RTHIPDSDQS TMSPDIITEV ITRLSSSSMM 

9951 SESTQMTITT QKSSPGATAQ STLTLATTTA PLARTHSTVP PRFLHSEMTT 
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Table 5 (continued) 



Human Protein of CA125 Molecule 
(SEQIDNO:S) 



10001 


LMSRSPENPS WKSSPFVEKT 


SSSSSLLSLP VTTSPSVSST 


LPQSIPSSSF 


10051 


SVTSLLTPGM VKTTDTSTBP 


GTSLSPNLSG TSVEILAASE 


VTTDTEKIHP 


10101 


SSSMAVTNVG TTSSGHELYS 


SVSIHSEPSK ATYPVGTPSS 


MAETSISTSM 


10151 


PANFETTGFE AEPFSHLTSG 


FRKTNMSLDT SSVTPTNTPS 


SPGSTHLLQS 


10201 


SKTDFTSSAK TSSPDWPPAS 


QYTEIPVDII TPFNASPSIT 


ESTGITSFPE 


10251 


SRFTMSVTES THHLSTDLLP 


SAETISTGTV MPSLSEAMTS 


FATTGVPRAI 


10301 


SGSGSPFSRT ESGPGDATLS 


TIAESLPSST PVPFSSSTFT 


TTDSSTIPAL 


10351 


HEITSSSATP YRVDTSLGTE 


SSTTEGRLVM VSTLDTSSQP 


GRTSSTPILD 


10401 


TRMTESVBLG TVTSAYQVPS 


LSTRLTRTDG IMEHITKIPN 


EAAHRGTIRP 


10451 


VKGPQTSTSP ASPKGLHTGG 


TKRMETTTTA LKTTTTALKT 


TSRATLTTSV 


10501 


YTPTLGTLTP LNASRQMAST 


ILTEMMITTP YVFPDVPETT 


SSLATSLGAE 


10551 


TSTALPRTTP SVLNRESETT 


ASLVSRSGAE RSPVIQTLDV 


SSSEPDTTAS 


10601 


WVIHPAETIP TVSKTTPNPP 


HSELDTVSST ATSHGADVSS 


AIPTNISPSE 


10651 


LDALTPLVTI SGTDTSTTFP 


TLTKSPHETE TRTTWLTHPA 


ETSSTIPRTI 


10701 


PNFSHHESDA TPSIATSPGA ETSSAIPIMT VSPGAEDLVT 


SQVTSSGTDR 


10751 


NMTIPTLTLS PGEPKTIASL VTHPBAQTSS AIPTSTISPA 


VSRLVTSMVT 


10801 


SLAAKTSTTN RALTNSPGEP ATTVSLVTHP AQTSPTVPWT 


TSIFFHSKSD 


10851 


TTPSMTTSHG AESSSAVPTP 


TVSTEVPGW TPLVTSSRAV 


ISTTIPILTL 


10901 


SPGEPBTTPS MATSHGEBAS 


SAIPTPTVSP GVPGWTSLV 


TSSRAVTSTT 


10951 


IPILTFSLGE PETTPSMATS 


HGTEAGSAVP TVLPEVPGMV 


TSLVASSRAV 


11001 


TSTTLPTLTL SPGEPETTPS 


MATSHGAEAS STVPTVSPEV 


PGWTSLVTS 


11051 


SSGVNSTSIP TLILSPGELE 


TTPSMATSHG AEASSAVPTP 


TVSPGVSGW 


11101 


TPLVTSSRAV TSTTIPILTL 


SSSEPETTPS MATSHGVEAS 


SAVLTVSPEV 


11151 


PGMVTSLVTS SRAVTSTTIP TLTISSDEPE TTTSLVTHSE AKMISAIPTL 


11201 


AVSPTVQGLV TSLVTSSGSE TSAFSNLTVA SSQPETIDSW VAHPGTEASS 
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Table 5 (continued) 



Human Protein of CA125 Molecule 
(SEQIDNO:5) 



11251 
11301 
11351 
11401 
11451 
11501 
11551 
11601 
11651 
11701 
11751 
11801 
11851 
11901 
11951 
12001 
12051 
12101 
12151 
12201 
12251 
12301 
12351 
12401 
12451 



WPTLTVSTG EPFTNISLVT HPAESSSTLP RTTSRFSHSE LDTMPSTVTS 
PEAESSSAIS TTISPGIPGV LTSLVTSSGR DISATFPTVP ESPHESEATA 
SWVTHPAVTS TTVPRTTPNY SHSEPDTTPS lATSPGAEAT SDFPTITVSP 
DVPDMVTSQV TSSGTDTSIT IPTLTLSSGE PETTTSFITY SETHTSSAIP 
TLPVSPGASK MLTSLVISSG TDSTTTFPTL TETPYEPETT AIQLIHPAET 
NTMVPRTTPK FSHSKSDTTL PVAITSPGPE ASSAVSTTTI SPDMSDLVTS 
LVPS^GTDTS TTFPTLSETP YEPETTATWL THPAETSTTV SGTIPNFSHR 
GSDTAPSMVT SPGVDTRSGV PTTTIPPSIP GWTSQVTSS ATDTSTAIPT 
LTPSPGEPET TASSATHPGT QTGFTVPIRT VPSSEPDTMA SWVTHPPQTS 
TPVSRTTSSF SHSSPDATPV MATSPRTEAS SAVLTTISPG APEMVTSQIT 
SSGAATSTTV PTLTHSPGMP ETTALLSTHP RTETSKTFPA STVFPQVSET 
TASLTIRPGA ETSTALPTQT TSSLFTLLVT GTSRVDLSPT ASPGVSAKTA 
PLSTHPGTET STMIPTSTLS LGLLETTGLL ATSSSAETST STLTLTVSPA 
VSGLSSASIT TDKPQTVTSW NTETSPSVTS VGPPEFSRTV TGTTMTLIPS 
EMPTPPKTSH GE6VSPTTIL RTTMVEATNL ATTGSSPTVA KTTTTFNTLA 
GSLFTPLTTP GMSTLASESV TSRTSYNHRS WISTTSSYNR RYWTPATSTP 
VTSTFSPGIS TSSIPSSTAA TVPFMVPFTL NFTITNLQYE EDMRHPGSRK 
FNATERELQG LLKPLFRNSS LEYLYSGCRL ASLRPEKDSS AMAVDAICTH 
RPDPEDLGLD RERLYWBLSN LTNGIQELGP YTLDRNSLYV NGFTHRSSMP 
TTSTPGTSTV DVGTSGTPSS SPSPTAAGPL LMPFTLNFTI TNLQYEEDMR 
RTGSRKFNTM ESVLQGLLKP LFKNTSVGPL YSGCRLTLLR PEKDGAATGV 
DAICTHRLDP KSPGLNREQL YWELSKLTND lEELGPYTLD RNSLYVNGFT 
HQSSVSTTST PGTSTVDLRT SGTPSSLSSP TIMAAGPLLV PFTLNFTITN 
LQYGEDMGHP GSRKFNTTER VLQGLLGPIF KNTSVGPLYS GCRLTSLRSE 
KDGAATGVDA ICIHHLDPKS P6LNRERLYW ELSQLTNGIK ELGPYTLDRN 
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Table 5 (continued) 



Human Protein of CA125 Molecule 
(S£QIDNO:S) 



12501 SLYVNGFTHR TSVPTSSTPG TSTVDLGTSG TPFSLPSPAT AGPLLVLFTL 

12551 NFTITNLKYE EDMHRPGSRK FNTTERVLQT LLGPMFKNTS VGLLYSGCRL 

12601 TLLRSEKDGA ATGVDAICTH RLDPKSPGLD REQLYWELSQ LTNGIBOBLGP 

12651 YTLDRNSLYV NGFTHWIPVP TSSTPGTSTV DLGSGTPSSL PSPTAAGPLL 

12701 VPFTLNFTIT NLQYEEDMHH PGSRKFNTTE RVLQGLLGPM FKNTSVGLLY 

12751 SGCRLTLIiRS EKDGAATGVD AICTHRLDPK SPGVDREQLY WELSQLTNGI 

12801 KELGPYTLDR NSLYVNGFTH QTSAPNTSTP GTSTVDLGTS GTPSSLPSPT 

12851 SAGPLLVPFT LNFTITNLQY EEDMRHPGSR KFNTTERVLQ GLLKPLFKST 

12901 SVGPLYSGCR LTLLRSEKDG AATGVDAICT HRLDPKSPGV DREQLYWELS 

12951 QLTNGIKELG PYTLDRNSLY VNGPTHQTSA PNTSTPGTST VDLGTSGTPS 

13001 SLPSPTSAGP LLVPFTLNFT ITNLQYEEDM HHPGSRKFNT TERVLQGLLG 

13051 PMFKNTSVGL LYSGCRLTLL RPEKNGAATG MDAICSHRLD PKSPGLNREQ 

13101 LYWELSQLTH GIKBLGPYTL DRNSLYVNGF THRSSVAPTS TPGTSTVDLG 

13151 TSGTPSSLPS PTTAVPLLVP FTLNFTITNL QYGEDMRHPG SRKFNTTERV 

13201 LQGLLGPLFK NSSVGPLYSG CRLISLRSEK Z/GAATCVDAI CTHHLNPQSP 

13251 GLDREQLYWQ LSQMTNGIKE LGPYTLDRNS LYVNGFTHRS SGLTTSTPWT 

13301 STVDLGTSGT PSPVPSPTTA GPLLVPFTLN FTITNLQYEE DMHRPGSRKF 

13351 NATERVLQGL LSPIFKNSSV GPLYSGCRLT SLRPEKDGAA TGMDAVCLYH 

13401 PNPKRPGLDR EQLYWELSQL THNITELGPY SLDRDSLYVN GFTHQNSVPT 

13451 TSTPGTSTVY WATTGTPSSF PGPTTEPGPLL IPFTFNFTIT NLHYEENMQH 

13501 PGSRKFNTTE RVLQGLLKPL FKNTSVGPLY S6CRLTSLRP EKDGAATGMD 

13551 AVCLYHPNPK RPGLDREQLY CELSQLTHNI TBLGPYSLDR DSLYVNGFTH 

13601 QNSVPTTSTP GTSTVYWATT GTPSSFPGHT EPGPLLIPFT FNFTITNLHY 

13651 EENMQHPGSR KFNTTERVLQ GLLKPLFKNT SVGPLYSGCR LTLLRPEKHE 

13701 AATGVDTICT HRVDPIGP6L DRERLYWELS QLTNSITELG PYTLDRDSLY 
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Human Protein of CA125 Molecule 
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13751 VNGFNPRSSV PTTSTPGTST VHLATSGTPS SLPGHTAPVP LLIPFTLNFT 

13801 ITNLHYEENM QHPGSRKFNT TERVLQGLLK PLFKNTSVGP LYSGCRLTLL 

13851 RPEKHEAATG VDTICTHRVD PIGPGLDREX LYWELSXLTX- XIXELGPYXL 

13901 DRXSLYWGF XXXXXXXXTS TPGTSXVXLX TSGTPXXXPX XTSAGPLLVP 

13951 PTLNFTITNL QYEEDMHHPG SRKFNTTERV LQGLLGPMFK NTSVGLLYSG 

14001 CRLTLLRPEK NGAATGMDAI CSHRLDPKSP GLDREQLYWE LSQLTHGIKE 

14051 LGPYTLDRNS LYVNGFTHRS SVAPTSTPGT STVDLGTSGT PSSLPSPTTA 

14101 VPLLVPFTLN FTITNLQYGE DMRHPGSRKF NTTERVLQGL LGPLFKNSSV 

14151 GPLYSGCRLI SLRSEKDGAA TGVDAICTHH LNPQSPGLDR EQLYWQLSQM 

14201 TNGIKELGPY TLDRNSLYVN GFTHRSSGLT TSTPWTSTVD LGTSGTPSPV 

14251 PSPTTAGPLL VPFTLNFTIT NLQYEEDMHR PGSRKFNATE RA/LQGLLSPI 

14301 FKNSSVGPLY SGCRLTSIiRP EKDGAATC^ID AVCLYHPNPK RPGLDREQLY 

14351 WELSQLTHNI TELGPYSLDR DSLYVNGFTH QSSMTTTRTP DTSTMHLATS 

14401 RTPASLSGPT TASPLLVLFT INCTITNLQY EEDMRRTGSR KFNTMESVLQ 

14451 GLLKPLFKNT SVGPLYSGCR LTLLRPKKDG AATGVDAICT HRLDPKSPGL 

14501 NREQLYWELS KLTNDIEELG PYTLDRNSLY VNGFTHQSSV STTSTPGTST 

14551 VDLRTSGTPS SLSSPTIMXX XPLLXPFTLN FTITNLXYEE XMXXPGSRKF 

14601 NTTERVLQGL LRPLFKNTSV SSLYSGCRLT LLRPBKDGAA TRVDAACTYR 

14651 PDPKSPGLDR EQLYWELSQL THSITELGPY TLDRVSLYVN GFNPRSSVPT 

14701 TSTPGTSTVH LATSGTPSSL PGHTXXXPLL XPFTLNFTIT NLXYEEXMXX 

14751 PGSRKFNTTE RVLQGLLKPL FRNSSLEYLY SGCRLASLRP EKDSSAMAVD 

14801 AICTHRPDPE DLGLDRERLY WELSNLTNGI QELGPYTLDR NSLYVNGFTH 

14851 RSSFLTTSTP WTSTVDLGTS GTPSPVPSPT TAGPLLVPFT LNFTITNLQY 

14901 EEDMHRPGSR RFNTTERVLQ GLLTPLFKNT SVGPLYSGCR LTLLRPBKQB 

14951 AATGVDTICT HRVDPIGPGL DRERLYWBLS QLTNSITELG PYTLDRDSLY 
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Table 5 (continued) 



Human Protein of CA125 Molecule 
(SEQIDN0:5) 



15001 VNGFNPWSSV PTTSTPGTST VHLATSGTPS SLPGHTAPVP LLIPFTLNFT 
15051 ITDLHYEBNM QHPGSRKFNT TBRVLQGLLK PLPKSTSVGP LYSGCRLTLL 
15101 RPEKHGAATG VDAICTLRLD PT6PGLDRBR LYWELSQLTN SVTBLGPYTL 
15151 DRDSLYVNGF THRSSVPTTS IPGTSAVHLE TSGTPASLPG HTAPGPLLVP 
15201 FTLNPTITNL QYEBDMRHPG SRKFSTTERV LQGLLKPLFK NTSVSSLYSG 
15251 CRLTLLRPBK DGAATRVDAV CTHRPDPKSP GLDRERLYWK LSQLTHGITE 
15301 LGPYTLDRHS LYVNGFTHQS SMTTTRTPDT STMHLATSRT PASIiSGPTTA 
15351 SPLLVLFTIN FTITNORYBE NMHHPGSRKF NTTERVLQGL LRPVFKNTSV 
15401 GPLYSGCRLT LLRPKKDGAA TKVDAICTYR PDPKSPGLDR EQLYWELSQL 
15451 THSITELGPY TQDRDSLYVN GFTHRSSVPT TSIPGTSAVH LETSGTPASL 
15501 P6HTAPGPLL VPFTLNFTIT NLQYEEDMRH PGSRKFNTTE RVLQGLLKPL 
15551 FKSTSVGPLY SGCRLTLLRP BKRGAATGVD TICTHRLDPL NPGLDRBQLY 
15601 WELSKLTRGI lELGPYLLDR GSLYVNGFTH RTSVPTTSTP GTSTVDLGTS 
15651 GTPFSLPSPA XXXPLLXPFT LNFTITNUCY EBXMXXPGSR KFNTTERVLQ 
15701 TLLGPMFKNT SVGLLYSGCR LTLLRSBKDG AATGVDAICT HRLDPKSPGV 
15751 DRBQLYWELS QLTNGIKELG PYTLDRNSLY VNGFTHWIPV PTSSTPGTST 
15801 VDLGSGTPSL PSSPTTAGPL LVPFTLNPTI TNLKYEEDMH CPGSRKFNTT 
15851 ERVLQSLLGP MFKNTSVGPL YSGCSILTLLR SEKDGAATGV DMCTHRM)? 
15901 KSPGVDRBQL YWELSQLTNG IKELGPYTLD RNSLYVNGFT HQTSAPNTST 
15951 P6TSTVDL6T SGTPSSLPSP TXXXPLLXPF TLNFTITNLX YEEXMXXPGS 
16001 RKFNTTERVL QGLLXPXPKX TSVGXLYSGC RLTLLRXBKX XAATXVDXXC 
16051 XXXXDPXXPG LDREXLYWEL SXLTXXIXEL GPYXLDRXSL YVHGFTHWIP 
16101 VPTSSTPGTS.TVDLGSGTPS SLPSPTTAGP LLVPFTLNFT ITNLKYEBDM 
16151 HCPGSRKFNT TERVLQSLLG PMFKNTSVGP LYSGCRLTSL RSEKDGAATG 
16201 VDAICTHRVD PKSPGVDRBQ LYWELSQLTN GIKBL6PYTL DRNSLYVNGP 
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Human Protein of CA125 Molecule 
(SEQIDNO:5) 



16251 


THQTSAPNTS 


TPGTSTVDLG TSGTPSSLPS 


PTSAGPLLVP 


FTLNFTITNL 


16301 


QYEEDMHHPG SRKFNTTERV LQGLLGPMFK 


NTSVGLLYSG 


CRLTLLRPEK 


16351 


NGAATGMDAI 


CTHRLDPKSP GLDREXLYWE 


LSXLTXXIXB 


LGPYXLDRXS 


16401 


LYVNGFXXXX 


XXXXTSTPGT SXVXLXTSGT 


PXXXPXXTXX 


XPLLXPFTLN 


16451 


FTITNLXYEE 


XMXXPGSRKF JJTTERVLQGL 


LKPLFRNSSL 


EYLYSGCRLA 


16501 


SLRPEKDSSA MAVDAICTHR PDPEDLGLDR 


ERLYWELSNL 


TNGIQELGPY 


16551 


TLDRNSLYVN GFTHRSSMPT TSTPGTSTVD 


VGTSGTPSSS 


PSPTTAGPLL 


16601 


IPFTLNFTIT NLQYGEDMGH PGSRKFNTTE 


RVLQGLLGPI 


FKNTSVGPLY 


16651 


SGCRLTSLRS 


EKDGAATGVD AICIHHLDPK 


SPGLNRERLY WELSQLTNGI 


16701 


KELGPYTLDR 


NSLYVNGFTH RTSVPTTSTP 


GTSTVDLGTS 


GTPFSLPSPA 


16751 


TAGPLLVLFT 


LNFTITNLKY EEDMHRPGSR 


KFNTTERVLQ 


TLLGPMFKNT 


16801 


SVGLLYSGCR 


LTLLRSEKDG AATGVDAICT 


HRLDPKSPGL 


DREXLYWELS 


16851 


XLTXXIXELG 


PYXLDRXSLY VNGFXXXXXX 


XXTSTPGtSX 


VXLXTSGTPX 


16901 


XXPXXTXXXP 


LLXPFTLNFT ITNLXYEEXM 


XXPGSRKFNT 


TERVIiQGLLR 


16951 


PVFKNTSVGP 


LYSGCRLTLL RPKKDGAATK 


VDAICTYRPD 


PKSP6LDREQ 


17001 


LYWELSQLTH 


SITELGPYTQ DRDSLYVNGF 


THRSSVPTTS 


IPGTSAVHLE 


17051 


TTGTPSSFPG 


HTEPGPLLIP FTFNFTITNL 


RYEEI^QHPG 


SRKFNTTERV 


17101 


LQGLLTPLFK NTSVGPLYSG CRLTLLRPBK 


QEAATGVDTI 


CTHRVDPIGP 


17151 


GIJDRERLYWE 


LSQLTNSITE LGPYTLDRDS 


LYVDGFNPWS 


SVPTTSTPGT 


17201 


STVHLATSGT 


PSPLPGHTAP VPLLIPFTLN 


FTITDLHYEE 


NMQHPGSRKF 


17251 


NTTERVLQGL 


LKPLFKSTSV GPLYSGCRLT 


LLRPEKHGAA 


TGVDAICTLR 


17301 


LDPTGPGLDR ERLYWELSQL TNSITELGPY TLDRDSLYVN GFNPWSSVPT 


17351 


TSTPGTSTVH 


LATSGTPSSL PGHTTAGPLL 


VPFTLNFTIT 


NLKYEEDMHC 


17401 


PGSRKFNTTE 


RVLQSLHGPM FKNTSVGPLY 


SGCRLTLLRS 


EKDGAATGVD 


17451 


AICTHRLDPK 


SPGLDREXLY l^ELSXLTXXI 


XELGPYXLDR XSLYVNGFXX 
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Table 5 (continued) 



Human Protein of CA12S Molecule 
(SEQ ED NO: 5) 



17501 XXXXXXTSTP GTSXVXLXTS GTPXXXPXXT XXXPLLXPFT LNFTITNLXY 

17551 EEXMXXPGSR KPNTTERVLQ GLLXPXFKXT SVGXLYSGCR LTLLRXEKXX 

17601 AATXVDXXCX XXXDPXXPGL DREXLYWELS XLTNSITELG PYTLDRDSLY 

17651 VNGPTHRSSM PTTSIPGTSA VHLETSGTPA SLPGHTAPGP LLVPFTLNFT 

17701 ITNLQYEEDM RHPGSRKFNT TERVLQGLLK PLFKSTSVGP LYSGCRLTLL 

17751 RPEKRGAATG VDTICTHRLD PLNPGLDREX LYWELSXLTX XIXELGPYXL 

17801 DRXSLYVNGF XXXXXXXXTS TPGTSXVXLX TSGTPXXXPX XTXXXPLLXP 

17851 FTLNFTITNL XYEEXMXXPG SRKFNTTERV LQGLLXPXFK XTSVGXLYSG 

17901 CRLTLLRXEK XXAATXVDXX CXXXXDPXXP GLDREXLYWE LSXLTXXIXE 

17951 LGPYXLDRXS LYVNGFHPRS SVPTTSTPGT STVHLATSGT PSSLPGHTAP 

18001 VPLLIPFTLN FTITNLHYEE NMQHP6SRKF NTTERVLQGL LGPMFKNTSV 

18051 GLLYSGCRLT LLRPEKNGAA TGMDAICSHR LDPKSPGLDR EXLYWELSXL 

18101 TXXIXBLGPY XLDRXSLYVN GFXXXXXXXX TSTPGTSXVX LXTSGTPXXX 

18151 PXXTXXXPLL XPFTLNFTIT NLXYEEXMXX PGSRKFNTTE RVLQGLLXPX 

18201 FKXTSVGXLY SGCRLTLLRX EKXXAATXVD XXCXXXXDPX XPGLDREXLY 

18251 WELSXLTXXI XELGPYXLDR XSLYVNGFTH QNSVPTTSTP GTSTVYWATT 

18301 GTPSSFPGHT EPGPLLIPFT FNFTITNLHY EENMQHPGSR KFNTTERVLQ 

18351 GLLTPLFKNT SVGPLYSGCR LTLLRPEKQE AATGVDTICT HRVDPIGPGL 

18401 DREXLYWELS XLTXXIXELG PYXLDRXSLY VNGFXXXXXX XXTSTPGTSX 

18451 VXLXTSGTPX XXPXXTXXXP LLXPFTLNFT ITNLXYEEXM XXPGSRKFNT 

18501 TERVLQGLLX PXFKXTSVGX LYSGCRLTLL RXEKXXAATX VDXXCXXXXD 

18551 PXXPGLDREX LYWELSXLTX XIXELGPYXL DRXSLYVNGF THRSSVPTTS 

18601 SPGTSTVHLA TSGTPSSLPG HTAPVPLLIP FTLNFTITNL HYEENMQHPG 

18651 SRKFNTTERV LQGLLKPLFK STSVGPLYSG CRLTLLRPEK HGAATGVDAI 

18701 CTLRLDPTGP GLDREXLYWE LSXLTXXIXE LGPYXLDRXS LYVNGFXXXX 
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Table 5 (continued) 



Human Protein of CA125 Molecule 
(SEQIDNO:5) 



18751 XXXXTSTPGT SXVXLXTSGT PXXXPXXTXX XPLLXPFTLN FTITNLXYEE 

18801 XMXXPGSRKF NTTERVLQGL LXPXFKXTSV GXLYSGCRLT LLRXEKXXAA 

18851 TXVDXXCXXX XDPXXPGLDR EXLYWELSXL TXXIXELGPY XLDRXSLYVN 

18901 GFTHRTSVPT TSTPGTSTVH LATSGTPSSL PGHTAPVPLL IPFTLNFTIT 

18951 NLQYEEDMHR PGSRKFNTTE RVLQGLLSPI FKNSSVGPLY SGCRLTSIiRP 

19001 EKDGAATGMD AVCLYHPNPK RPGLDREQLY CELSQLTHNI TELGPYSLDR 

19051 DSLYVNGFTH QNSVPTTSTP GTSTVYWATT GTPSSFPGHT XXXPLLXPFT 

19101 LNFTITNLXY EEXMXXPGSR KFNTTERVLQ GLLXPXFKXT SVGXLYSGCR 

19151 LTLLRXEKXX AATXVDXXCX XXXDPXXPGL DREXLYWELS XLTXXIXELG 

19201 PYXLDRXSLY VNGFTHWSSG LTTSTPWTST VDLGTSGTPS PVPSPTTAGP 

19251 LLVPFTLNFT ITNLQYEEDM HRPGSRKFNA TERVLQGLLS PIPKNTSVGP 

19301 LYSGCRLTLL RPEKQEAATG VDTICTHRVD PIGPGLDREX LYWELSXLTX 

19351 XIXELGPYXL DRXSLYVNGF X3CXXXXXXTS TPGTSXVXLX TSGTPXXXPX 

19401 XTXXXPLLXP FTIiNFTITNL XYEEXMXXPG SRKFNTTERV LQGLLXPXFK 

19451 XTSVGXLYSG CRLTLLRXEK XXAATXVDXX CXXXXDPXXP GLDREXLYWE 

19501 LSXLTXXIXE LGPYXLDRXS LYVNGFTHRS FGLTTSTPWT STVDLGTSGT 

19551 PSPVPSPTTA GPLLVPFTLN FTITNLQYEB DMHRPGSRKF NTTERVLQGL 

19601 LTPLFRNTSV SSLYSGCRLT LLRPEKDGAA TRVDAVCTHR PDPKSPGLDR 

19651 EXLYWELSXL TXXIXELGPY XLDRXSLYVN GFXXXXXXXX TSTPGTSXVX 

19701 LXTSGTPXXX PXXTXXXPLL XPFTLNFTIT NLXYEEXMXX PGSRKFNTTE 

19751 RVLQGLLXPX FKXTSVGXLY SGCRLTLLRX EKXXAATXVD XXCXXXXDPX 

19801 XPGLDREXLY WELSXLTXXI XELGPYXLDR XSLYVNGFTH WIPVPTSSTP 

19851 GTSTVDLGSG TPSSLPSPTT AGPLLVPFTL NFTITNLQYG EDMGHPGSRK 

19901 FNTTERVLQG LL6PIFKNTS VGPLYSGCRL TSLRSEKDGA ATGVDAICIH 

19951 HLDPKSPGLD REXLYWELSX LTXXIXELGP YXLDRXSLYV NGFXXXXXXX 

Table 5 (continued) 

201 



wo 2004/045SS3 



PCT/US2003/037041 



Human Protein of CA125 Molecule 
(SEQIDNO:5) 



20001 


XTSTPGTSXV 


xlxtsgtpxx 


XPXXTXXXPL LXPFTLNFTI TNLXYEEXMX 


20051 


XPGSRKFNTT 


ERVLQGLLXP 


XFKXTSVGXL YSGCRLTLLR XEKXXAATXV 


20101 


DXXCXXXXDP 


XXPGLDREXL 


YWELSXLTXX IXELGPYXLD RXSLYVNGFT 


20151 


HQTFAPNTST 


PGTSTVDLGT 


SGTPSSLPSP TSAGPLLVPF TLNFTITNLQ 


20201 


YEEDMHHPGS 


RKFNTTERVL 


QGLLGPMFKN TSVGLLYSGC RLTLLRPEKN 


20251 


GAATRVDAVC 


THRPDPKSPG 


LDREXLYWEL SXLTXXIXEL GPYXLDRXSL 


20301 


YVNGFXXXXX 


XXXTSTPGTS 


XVXLXTSGTP XXXPXXTAPV PLLIPFTLNF 


20351 


TITNLHYEEN MQHPGSRKFN 


TTERVLQGLL RPLFKSTSVG PLYSGCRLTL 


20401 


LRPEKHGAAT 


GVDAICTLRL 


DPTGPGLDRE RLYWELSQLT NSVTELGPYT 


20451 


LDRDSLYVNG 


FTQRSSVPTT 


SIPGTSAVHL ETSGTPASLP GHTAPGPLLV 


20501 


PFTLNFTITN LQYEVDMRHP 


GSRKFNTTER VLQGLLKPLF KSTSVGPLYS 


20551 


GCRLTLLRPE 


KRGAATGVDT 


ICTHRLDPLN PGLDREQLYW ELSKLTRGII 


20601 


ELGPYLLDRG 


SLYVNGFTHR 


NFVPITSTPG TSTVHLGTSE TPSSLPRPIV 


20651 


PGPLLVPFTL 


NFTITNLQYE 


EAMRHPGSRK FNTTERVLQG LLRPLFKNTS 


20701 


IGPLYSSCRL 


TLLRPEKDKA 


ATRVDAICTH HPDPQSPGLN REQLYWELSQ 


20751 


LTHGITELGP 


YTLDRDSLYV 


DGFTHWSPIP TTSTPGTSIV NLGTSGIPPS 


20801 


LPETTXXXPL 


LXPFTLNFTI 


TNLXYEEXMX XPGSRKFNTT ERVLQGLLKP 


20851 


LFKSTSVGPL 


YSGCRLTLLR 


PEKDGVATRV DAICTHRPDP KIPGLDRQQL 


20901 


YWELSQLTHS 


ITELGPYTLD RDSLYVNGP^ QRSSVPTTST PGTFTVQPET 


20951 


SETPSSLPGP TATGPVLLPF TLNFTITNLQ YEEDMHRPGS RKFNTTERVL 


21001 


QGLLMPLFKN TSVSSLYSGC RLTLLRPEKD GAATRVDAVC THRPDPKSPG 


21051 


LDRERLYWKL SQLTHGITEL GPYTLDRHSL YVNGFTHQSS MTTTRTPDTS 


21101 


TMHLATSRTP 


ASLSGPTTAS 


PLLVLFTINF TITNLRYEEN MHHPGSRKFN 


21151 


TTERVLQGLL "RPVFKNTSVG PLYSGCRLTL LRPKKDGAAT KVDAICTYRP 


21201 


DPKSPGLDRE QLYWELSQLT HSITELGPYT QDRDSLYNVG FTQRSSVPTT 
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21251 
21301 
21351 
21401 
21451 
21501 
21551 
21601 
21651 
21701 
21751 
21801 
21851 
21901 
21951 
22001 
22051 
22101 
22151 



SVPGTPTVDL GTSGTPVSKP GPSAASPLLV LFTLNGTITN LRYEENMQHP 
GSRKFNTTER VLQGLLRSLF KSTSVGPLYS GCRLTLLRPE KDGTATGVDA 
ICTHHPDPKS PRLDREQLYW ELSQLTHNIT ELGHYALDND SLFVNGFTHR 
SSVSTTSTPG TPTVYLGASK TPASIFGPSA ASHLLILFTL NFTITNLRYE 
BNMWPGSRKF NTTERVLQGL LRPLFKNTSV GPLYSGSRLT LLRPEKDGEA 
TGVDAICTHR PDPTGPGLDR EQLYLELSQL THSITELGPY TLDRDSLYVN 
GFTHRSSVPT TSTGWSEEP PTLNFTINNL RYMADMGQPG SLKFNITDNV 
MKHLLSPLFQ RSSLGARYTG CRVIALRSVK NGAETRVDLL CTYLQPLSGP 
GLPIKQVFHE LSQQTHGITR L6PYSLDKDS LYLNGYNEPG LDEPPTTPKP 
ATTFLPPLSE ATTAMGYHLK TLTLNFTISN LQYSPDMGKG SATFNSTEGV 
LQHLLRPLFQ KSSMGPFYLG CQLISLRPEK DGAATGVDTT CTYHPDPVGP 
GLDIQQLYWE LSQLTHGVTQ LGFYVLDRDS LPINGYAPQN LSIRGEYQIN 
FHIVNWNLSN PDPTSSEYIT LLRDIQDKVT TLYKGSQLHD TFRFCLVTNL 
TMDSVLVTVK ALFSSNLDPS LVEQVFLDKT LNASFHWLGS TYQLVDIHVT 
EMESSVYQPT SSSSTQHFYL NFTITNLPYS QDKAQPGTTN YQRNKRNIED 
ALNQLFRNSS IKSYFSDCQV STFRSVPNRH HTGVDSLCNF SPLARRVDRV 
AIYBEPLRMT RNGTQLQNFT LDRSSVLVDG YSPNRNEPLT GNSDLPFWAV 
ILIGLAGLLG LITCLICGVL VTTRRRKKEG EYNVQQQCPG YYQSHLDLED 
LQ 
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WE CLAIM 

1 . An isolated nucleic acid molecule encoding CA125. 

2. The isolated nucleic acid molecule of claim 1 comprising the sequence of SEQ ID NO: 4. 

3. The isolated nucleic acid molecule of claim 2 wherein the sequence has at least about 70% 
homology with SEQ ID NO: 4. 

4. The isolated nucleic acid molecule of claim 2 wherein said molecule is a fragment thereof. 

5. An isolated nucleic acid molecule comprising the sequence shown in SEQ ID NO: 1 . 

6. The isolated nucleic acid molecule of claim 5 wherein the sequence has at least about 70% 
homology with SEQ ID NO: 1 . 

7. The isolated nucleic acid molecule of claim 5 wherein said molecule is a fragment thereof. 

8. An isolated nucleic acid molecule comprising the sequence shown in SEQ ID NO: 2. 

9. The isolated nucleic acid molecule of claim 8 wherem the sequence has at least about 70% 
homology with SEQ ID NO: 2. 

10. The isolated nucleic acid molecule of claim 8 wherein said molecule is a fragment thereof 

11. An isolated nucleic acid molecule comprising the s iquencs shown in SEQ ID NO: 3. 

42. The isolated nucleic acid molecule of claim 1 1 wherein the sequence has at least about 70% 
homology with SEQ ID NO: 3. 

13. The isolated nucleic acid molecule of claim U .^yher^in s^d molecule is a fragment thereof. 

14. A polypeptide with the amino acid sequence selected from the group consisting of: (a) the 
amino acid sequence set forth in SEQ ID NO: 5; (b) an amino acid sequence having at least 50% 
sequence identity to said sequence; (c) a conservative variant of an one of (a) to (b); and (d) a 
fragment of any one of (a) to (c). 

15. A purified antibody that selectively binds to an amino acid sequence of the CA125 protein: 

(a) wherein the amino acid sequence of the CA125 protein comprises the anuno acid 
sequence set forth in SEQ ID NO: 5; 
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(b) an amino acid sequence having at least 50% sequence identity to said sequence; 

(c) a conservative variant of any one of (a) to (b); and 

(d) a fragment of any one of (a) to (c), 

16. The purified antibody of claim 15 wherein said sequence identity is at least 60%. 

17. The purified antibody of claim 1 5 wherein said sequence identity is at least 70%. 

1 8. The purified antibody of claun 1 5 wherein said sequence identity is at least 80%. 

1 9. The purified antibody of claim 1 5 wherein said sequence identity is at least 90%. 

20. A method to make a purified firagment of the CA125 polypeptide of SEQ ID NO: 5 
comprising: 

(a) expressing a portion of the isolated nucleic acid molecule set out in SEQ ID NO: 4 to 
obtain a fragment of (he CA125 molecule; and 

(b) purifying said fragment of the CA125 molecule. 
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SEQUENCE LISTING 



<H0> The Board of Trustees of the university of Arkansas 
O'Brien, Timothy 
Beard, John 
underwood, Lowell 

<120> CA125 Gene and Its Use for Diagnostic and Therapeutic 
Interventions 

<130> 022438.43867 

<150> 60/284,175 
<151> 2001-04-17 

, <150> 60/299,380 
<iB^ 2001-06-19 

<156>\09/965,738 
<151> 2001-09-27 

" <150> 60/345,180 
<151> 2001-12-02 

<150> 60/427,045 
<151> 2002-11-15 

<160> 7 

<170> Patentin version 3.2 

<210> 1 

<211> 57082 

<212> DNA 

<213> Homo sapiens 



<220> 

<2 2 1> mi s c_f eatu r e 

<222> (2205) . . (11679) 

<223> Exon 1 

<220> 

<221> mi sc_f eature 

<222> (13464) . . (13570) 

<223> Exon 2 

<220> 

<221> mi sc_f eatu re 

<222> (16177) . . (34419) 

<223> Exon 3 

<220> 

<221> mi sc_f eature 

<222> (34420) . . (34534) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (34575).. (38024) 

<223> Exon 4 



<220> 
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<221> misc_feature 

<222> (38689) . . (38800) 

<223> Exon 5 

<220> 

<221> misc_feature 

<222> (39715) . • (39771) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (40578).. (45257) 

<223> Exon 6 

<220> 

<221> misc^feature 

<222> (47360) (47395) 

<223> Exon 7 

<220> 

<221> misc_feature 

<222> (52407).. (52442) 

<223> Exon 8 

<220> 

<221> mi sc_f eature 

<222> (52686) . . (52744) 

<223> Exon 9 



<400> 1 

ggtgcgcacc actatgtctg 


gctaattttt 


gtattttttt 


gtagagacat 


ggtttcacca 


60 


tgttggccag gctggtctcg 


aattcctgac 


ttcaagtaat 


ccacccacct 


cagcctccca 


120 


aagtgctggg attacaagca 


tgagccacca 


tgcatggcct 


aaagcttctt 


ttaaagccac 


180 


caagtccctt cccatgttag 


cccactaatc 


catgggttag 


tcatgaatgg 


attaatctat 


240 


tcatacggac agagccctca 


tcacccaatc 


acctcttaaa 


ggccccacct 


ctcaatactg 


300 


ccacactggg gattaagttt 


caacagagtt 


ttggagggga 


cattcaaatc 


atagtaatgc 


360 


ccaaagtgaa aaatcttccc 


tgcacttttc 


cctcaacaaa 


aacagccaga 


gatagtgagc 


420 


tgccaggaaa ttcttttttt 


tttcctcttc 


tgtcctaaat 


cagcatcgct 


agacctttac 


480 


atgattcaac ctcatcttct 


tcaccctctg 


ggtcatgaaa 


ttttatttat 


ttatttatta 


540 


ttttcttggg acagactctg 


gctctgtcgc 


ccaggctgaa 


gtgcagtggt 


gtgatcttgg 


600 


ctcactgcaa cctccgcctc 


ccgggttcaa 


gcgattctcc 


tgcctcagcc 


tcctgagtag 


660 


ctgggattac aggtgggcgc 


caccacaccc 


agctaatttt 


ttgtattttt 


agtagagatg 


720 


gggtttcacc atattagcca 


ggatggtctc 


catctcttga 


cctcgtgatc 


tgcccacctc 


780 


agcctcccaa aatgctggga 


ttacaggcat 


gagacaccac 


gcccagcagg 


ccagggtcat 


840 


gagattttaa tcaagagcaa 


cttccactga 


ttcctgagag 


tgcatctgtg 


ggcccctgct 


900 


ctgatctgaa cagaagtgcc 


gtgtcttctc 


tgacctccac 


ttctcaattc 


aagagcctta 


960 


gtatctgcca gtatcacaca 


ctgagcatta 


gctccatctc 
Page 


atgggggtgt 
2 


aggtaggggc 


1020 
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tctatctgca tctttctttc tttttttctt tctttccctt cctcccttcc tcactccctc 1080 

ggtcctctct ttctttcctt ttctttcttc cttcctccct tcctccctcc ctccctctct 1140 

ctttctctct ttctttcttt ccttctttct ttctttctct cttccttccc tccctccctc 1200 

cttccttcct ttctctttct ttctctttct ttcttttttt ccttccttcc ttccttcttt 1260 

ctctttctct ccctcccttc cttccttcct tccttccttc cttcctttct ttctttcttt 1320 

ctttctttct ttctttcttt ctttctttct ttctttcttc cttccttcct tccttccttc 1380 

cttccttcct tccttccttt cttttctttc tttctctttc tttttgagac agagctctta 1440 

ttacccatgc tggagtgcag tggtgtgacc ttggcttact gcaacatctg cctcctaggg 1500 

tcaagtgatt ctcctgcctc agcctcctaa gtagctggga ttacagacac atgccaccac 1560 

acccaatatt tatttttatt aaaatttttt ttaaaattat ttttaaaaaa ttaaaaataa 1620 

ttttgtattt ttagtagaga cggggtttct ccatgttggt caggctgatc tcaaactccc 1680 

aacctcaggt gatcctccca cctcacctcc caaagtgctg ggattacagg catgagccac 1740 

cgtgcccagc ctggttcctg gtttctaaga catcacacac acacacacac acacacacac 1800 

acactcacac actcagagag agagagagag agaggatcat taagacatga tacactaaga 1860 

aattctattc tgcagacact gagaatccgt taaaaagttt gaagggaaga attgagatca 1920 

tcaggtgttt atttgaggaa attgtctgtg gttgaactat cctttccttt ctctccctga 1980 

gatttggtct tctcaattag aagcgttgca caattccccc aacctccata catacggcag 2040 

ctcttctaga cacaggtttt cccaggtcaa atgcggggac cccagccata tctcccaccc 2100 

tgagaaattt tggagtttca gggagctcag aagctctgca gaggccaccc tctctgaggg 2160 

gattcttctt agacctccat ccagaggcaa atgttgacct gtccatgctg aaaccctcag 2220 

gccttcctgg gtcatcttct cccacccgct ccttgatgac agggagcagg agcactaaag 2280 

ccacaccaga aatggattca ggactgacag gagccacctt gtcacctaag acatctacag 2340 

gtgcaatcgt ggtgacagaa catactctgc cctttacttc cccagataag accttggcca 2400 

gtcctacatc ttcggttgtg ggaagaacca cccagtcttt gggggtgatg tcctctgctc 2460 

tccctgagtc aacctctaga ggaatgacac actccgagca aagaaccagc ccatcgctga 2520 

gtccccaggt caatggaact ccctctagga actaccctgc tacaagcatg gtttcaggat 2580 

tgagttcccc aaggaccagg accagttcca cagaaggaaa ttttaccaaa gaagcatcta 2640 

catacacact cactgtagag accacaagtg gcccagtcac tgagaagtac acagtcccca 2700 

ctgagacctc aacaactgaa ggtgacagca cagagacccc ctgggacaca agatatattc 2760 

ctgtaaaaat cacatctcca atgaaaacat ttgcagattc aactgcatcc aaggaaaatg 2820 

ccccagtgtc tatgactcca gc^gagacca cagttactga ctcacatact ccaggaagga 2880 
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caaacccatc 


atttgggaca 


ctttattctt 


ccttccttga 


cctatcacct 


aaagggaccc 


2940 


caaattccag 


aggtgaaaca 


agcctggaac 


tgattctatc 


aaccactgga 


tatcccttct 


3000 


cctctcctga 


acctggctct 


gcaggacaca 


gcagaataag 


taccagtgcg 


cctttgtcat 


3060 


catctgcttc 


agttctcgat 


aataaaatat 


cagagaccag 


catattctca 


ggccagagtc 


3120 


tcacctcccc 


tctgtctcct 


ggggtgcccg 


aggccagagc 


cagcacaatg 


cccaactcag 


3180 


ctatcccttt 


ttccatgaca 


ctaagcaatg 


cagaaacaag 


tgccgaaagg 


gtcagaagca 


3240 


caatttcctc 


tctggggact 


ccatcaatat 


ccacaaagca 


gacagcagag 


actatcctta 


3300 


ccttccatgc 


cttcgctgag 


accatggata 


tacccagcac 


ccacatagcc 


aagactttgg 


3360 


cttcagaatg 


gttgggaagt 


ccaggtaccc 


ttggtggcac 


cagcacttca 


gcgctgacaa 


3420 


ccacatctcc 


atctaccact 


ttagtctcag 


aggagaccaa 


cacccatcac 


tccacgagtg 


3480 


gaaaggaaac 


agaaggaact 


ttgaatacat 


ctatgactcc 


acttgagacc 


tctgctcctg 


3540 


gagaagagtc 


cgaaatgact 


gccaccttgg 


tccccactct 


aggttttaca 


actcttgaca 


3600 


gcaagatcag 


aagtccatct 


caggtctctt 


catcccaccc 


aacaagagag 


ctcagaacca 


3660 


caggcagcac 


ctctgggagg 


cagagttcca 


gcacagctgc 


ccacgggagc 


tctgacatcc 


3720 


tgagggcaac 


cacttccagc 


acctcaaaag 


catcatcatg 


gaccagtgaa 


agcacagctc 


3780 


agcaatttag 


tgaaccccag 


cacacacagt 


gggtggagac 


aagtcctagc 


atgaaaacag 


3840 


agagaccccc 


agcatcaacc 


agtgtggcag 


cccctatcac 


cacttctgtt 


ccctcagtgg 


3900 


tctctggctt 


caccaccctg 


aagaccagct 


ccacaaaagg 


gatttggctt 


gaagaaacat 


3960 


ctgcagacac 


actcatcgga 


gaatccacag 


ctggcccaac 


cacccatcag 


tttgctgttc 


4020 


ccactgggat 


ttcaatgaca 


ggaggcagca 


gcaccagggg 


aagccagggc 


acaacccacc 


4080 


tactcaccag 


agccacagca 


tcatctgaga 


catccgcaga 


tttgactctg 


gccacgaacg 


4140 


gtgtcccagt 


ctccgtgtct 


ccagcagtga 


gcaagacggc 


tgctggctca 


agtcctccag 


4200 


gagggacaaa 


gccatcatat 


acaatggttt 


cttctgtcat 


ccctgagaca 


tcatctctac 


4260 


agtcctcagc 


tttcagggaa 


ggaaccagcc 


tgggactgac 


tccattaaac 


actagacatc 


4320 


ccttctcttc 


ccctgaacca 


gactctgcag 


gacacaccaa 


gataagcacc 


agcattcctc 


4380 


tgttgtcatc 


tgcttcagtt 


cttgaggata 


aagtgtcagc 


gaccagcaca 


ttctcacacc 


4440 


acaaagccac 


ctcatctatt 


accacaggga 


ctcctgaaat 


ctcaacaaag 


acaaagccca 


4500 


n f +■ f" a rt f fi 


+• <~ +• "I" "t* r" r* +■ r r* 


a uyclw.i.i. Lclcl 


y Cad. L. g Ccig ^ 


aaCaag LCC L 




*rDDU 


gaaatgcaac 


ttcccctctg 


actcatccat 


ctccatcagg 


ggaagagaca 


gcagggagtg 


4620 


tcctcactct 


cagcacctct 


gctgagacta 


cagactcacc 


taacatccac 


ccaactggga 


4680 


cactgacttc 


agaatcgtca 


gagagtccta 


gcactctcag 


cctcccaagt 


gtctctggag 


4740 


tcaaaaccac 


attttcttca 


tctactcctt 


ccactcatct 
page 


atttactagt 
4 


ggagaagaaa 


4800 
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cagaggaaac ttcgaatcca tctgtgtctc aacctgagac ttctgtttcc agagtaagga 4860 

ccaccttggc cagcacctct gtccctaccc cagtattccc caccatggac acctggccta 4920 

cacgttcagc tcagttctct tcatcccacc tagtgagtga gctcagagct acgagcagta 4980 

cctcagttac aaactcaact ggttcagctc ttcctaaaat atctcacctc actgggacgg 5040 

caacaatgtc acagaccaat agagacacgt ttaatgactc tgctgcaccc caaagcacaa 5100 

cttggccaga gactagtccc agattcaaga cagggttacc ttcagcaaca accactgttt 5160 

caacctctgc cacttctctc tctgctactg taatggtctc taaattcact tctccagcaa 5220 

ctagttccat ggaagcaact tctatcaggg aaccatcaac aaccatcctc acaacagaga 5280 

ccacgaatgg cccaggctct atggctgtgg cttctaccaa catcccaatt ggaaagggct 5340 

acattactga aggaagattg gacacaagcc atctgcccat tggaaccaca gcttcctctg 5400 

agacatctat ggattttacc atggccaaag aaagtgtctc aatgtcagta tctccatctc 5460 

agtccatgga tgctgctggc tcaagcactc caggaaggac aagccaattc gttgacacat 5520 

tttctgatga tgtctatcat ttaacatcca gagaaattac aatacctaga gatggaacaa 5580 

gctcagctct gactccacaa atgactgcaa ctcaccctcc atctcctgat cctggctctg 5640 

ctagaagcac ctggcttggc atcttgtcct catctccttc ttctcctact cccaaagtca 5700 

caatgagctc cacattttca actcagagag tcaccacaag catgataatg gacacagttg 5760 

aaactagtcg gtggaacatg cccaacttac cttccacgac ttccttgaca ccaagtaata 5820 

ttccaacaag tggtgccata ggaaaaagca ccctggttcc cttggacact ccatctccag 5880 

ccacatcatt ggaggcatca gaagggggac ttccaaccct cagcacctac cctgaatcaa 5940 

caaacacacc cagcatccac ctcggagcac acgctagttc agaaagtcca agcaccatca 6000 

aacttaccat ggcttcagta gtaaaacctg gctcttacac acctctcacc ttcccctcaa 6060 

tagagaccca cattcatgta tcaacagcca gaatggctta ctcttctggg tcttcacctg 6120 

agatgacagc tcctggagag actaacactg gtagtacctg ggaccccacc acctacatca 6180 

ccactacgga tcctaaggat acaagttcag ctcaggtctc tacaccccac tcagtgagga 6240 

cactcagaac cacagaaaac catccaaaga cagagtccgc caccccagct gcttactctg 6300 

gaagtcctaa aatctcaagt tcacccaatc tcaccagtcc ggccacaaaa gcatggacca 6360 

tcacagacac aactgaacac tccactcaat tacattacac aaaattggca gaaaaatcat 6420 

ctggatttga gacacagtca gctccaggac ctgtctctgt agtaatccct acctccccta 6480 

ccattggaag cagcacattg gaactaactt ctgatgtccc aggggaaccc ctggtccttg 6540 

ctcccagtga gcagaccaca atcactctcc ccatggcaac atggctgagt accagtttga 6600 

cagaggaaat ggcttcaaca gaccttgata tttcaagtcc aagttcaccc atgagtacat 6660 

Page 5 



wo 2004/045553 _ PCT/US2003/037041 



ttgctatttt 


tccacctatg 
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tccacacctt ctcatgaact ttcaaagtca gaggcagata 


6720 


ccagtgccat 


tagaaataca 


gattcaacaa 


cgttggatca 


gcacctagga 


atcaggagtt 


6780 


tgggcagaac 


tggggactta 


acaactgttc 


ctatcacccc 


actgacaacc 


acgtggacca 


6840 


gtgtgattga 


acactcaaca 


caagcacagg 


acaccctttc 


tgcaacgatg 


agtcctactc 


6900 


acgtgacaca 


gtcactcaaa 


gatcaaacat 


ctataccagc 


ctcagcatcc 


ccttcccatc 


6960 


ttactgaagt 


ctaccctgag 


ctcgggacac 


aagggagaag 


ctcctctgag 


gcaaccactt 


7020 


tttggaaacc 


atctacagac 


acactgtcca 


gagagattga 


gactggccca 


acaaacattc 


7080 


aatccactcc 


acccatggac 


aacacaacaa 


cagggagcag 


tagtagtgga 


gtcaccctgg 


7140 


gcatagccca 


ccttcccata 


ggaacatcct 


ccccagctga 


gacatccaca 


aacatggcac 


7200 


tggaaagaag 


aagttctaca 


gccactgtct 


ctatggctgg 


gacaatggga 


ctccttgtta 


7260 


ctagtgctcc 


aggaagaagc 


atcagccagt 


cattaggaag 


agtttcctct 


gtcctttctg 


7320 


agtcaactac 


tgaaggagtc 


acagattcta 


gtaagggaag 


cagcccaagg 


ctgaacacac 


7380 


agggaaatac 


agctctctcc 


tcctctcttg 


aacccagcta 


tgctgaagga 


agccagatga 


7440 


gcacaagcat 


ccctctaacc 


tcatctccta 


caactcctga 


tgtggaattc 


atagggggca 


7500 


gcacattttg 


gaccaaggag 


gtcaccacag 


ttatgacctc 


agacatctcc 


aagtcttcag 


7560 


caaggacaga 


gtccagctca 


gctaccctta 


tgtccacagc 


tttgggaagc 


actgaaaata 


7620 


caggaaaaga 


aaaactcaga 


actgcctcta 


tggatcttcc 


atctccaact 


ccatcaatgg 


7680 


aggtgacacc 


atggatttct 


ctcactctca 


gtaatgcccc 


caataccaca 


gattcacttg 


7740 


acctcagcca 


tggggtgcac 


accagctctg 


cagggacttt 


ggccactgac 


aggtcattga 


7800 


atactggtgt 


cactagagcc 


tccagattgg 


aaaacggctc 


tgatac '•tct 


tc':aagtccc 


7860 


tgtctatggg 


aaacagcact 


cacacttcca 


tgacttacac 


agagaagagt 


gaagtgtctt 


7920 


cttcaatcca 


tccccgacct 


gagacctcag 


ctcctggagc 


agagaccact 


ttgacttcca 


7980 


ctcctggaaa 


cagggccata 


agcttaacat 


tgcctttttc 


atccattcca 


gtggaagaag 


8040 


tcatttctac 


aggcataacc 


tcaggaccag 


acatcaactc 


agcacccatg 


acacattctc 


8100 


ccatcacccc 


accaacaatt 


gtatggacca 


gtacaggcac 


aattgaacag 


tccactcaac 


8160 


cactacatgc 


agtttcttca 


gaaaaagttt 


ctgtgcagac 


acagtcaact 


ccatatgtca 


8220 


actctgtggc 


agtgtctgct 


tcccctaccc 


atgagaattc 


agtctcttct 


ggaagcagca 


8280 


catcctctcc 


atattcctca 


gcctcacttg 


aatccttgga 


ttccacaatc 


agtaggagga 


8340 


atgcaatcac 


ttcctggcta 


tgggacctca 


ctacatctct 


ccccactaca 


acttggccaa 


8400 


gtactagttt 


atctgaggca 


ctgtcctcag 


gccattctgg 


ggtttcaaac 


ccaagttcaa 


8460 


ctacgactga 


atttccactc 


ttttcagctg 


catccacatc 


tgctgctaag 


caaagaaatc 


8520 


cagaaacaga 


gacccatggt 


ccccagaata 


cagccgcgag 
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cctcggtcac 


aggtctttct 


gagactcctg 


tgggggcaag 


tatcagctct 


gaagtccctc 


8640 


ttccaatggc 


cataacttct 


agatcagatg 


tttctggcct 


tacatctgag 


agtactgcta 


8700 


acccgagttt 


aggcacagcc 


tcttcagcag 


ggaccaaatt 


aactaggaca 


atatccctgc 


8760 


ccacttcaga 


gtctttggtt 


tcctttagaa 


tgaacaagga 


tccatggaca 


gtgtcaatcc 


8820 


ctttggggtc 


ccatccaact 


actaatacag 


aaacaagcat 


cccagtaaac 


agcgcaggtc 


8880 


cacctggctt 


gtccacagta 


gcatcagatg 


taattgacac 


accttcagat 


ggggctgaga 


8940 


gtattcccac 


tgtctccttt 


tccccctccc 


ctgatactga 


agtgacaact 


atctcacatt 


9000 


tcccagaaaa 


gacaactcat 


tcatttagaa 


ccatttcatc 


tctcactcat 


gagttgactt 


9060 


caagagtgac 


acctattcct 


ggggattgga 


tgagttcagc 


tatgtctaca 


aagcccacag 


9120 


gagccagtcc 


ctccattaca 


ctgggagaga 


gaaggacaat 


cacctctgct 


gctccaacca 


9180 


cttcccccat 


agttctcact 


gctagtttca 


cagagaccag 


cacagtttca 


ctggataatg 


9240 


aaactacagt 


aaaaacctca 


gatatccttg 


acgcacggaa 


aacaaatgag 


ctcccctcag 


9300 


atagcagttc 


ttcttctgat 


ctgatcaaca 


cctccatagc 


ttcttcaact 


atggatgtca 


9360 


ctaaaacagc 


ctccatcagt 


cccactagca 


tctcaggaat 


gacagcaagt 


tcctccccat 


9420 


ctctcttctc 


ttcagataga 


ccccaggttc 


ccacatctac 


aacagagaca 


aatacagcca 


9480 


cctctccatc 


tgtttccagt 


aacacctatt 


ctcttgatgg 


gggctccaat 


gtgggtggca 


9540 


ctccatccac 


tttaccaccc 


tttacaatca 


cccaccctgt 


cgagacaagc 


tcggccctat 


9600 


tagcctggtc 


tagaccagta 


agaactttca 


gcaccatggt 


cagcactgac 


actgcctccg 


9660 


gagaiaatcc 


tacctctagc 


aattctgtgg 


tgacttctgt 


tccagcacca 


ggtacatgga 


9720 


ccagtgtagg 


cagtactact 


gacttacctg 


ccatgggctt 


tctcaagaca 


agtcctgcag 


9780 


gagaggcaca 


ctcacttcta 


gcatcaacta 


ttgaaccagc 


cactgccttc 


actccccatc 


9840 


tctcagcagc 


agtggtcact 


ggatccagtg 


ctacatcaga 


agccagtctt 


ctcactacga 


9900 


gtgaaagcaa 


agccattcat 


tcttcaccac 


agaccccaac 


tacacccacc 


tctggagcaa 


9960 


actgggaaac 


ttcagctact 


cctgagagcc 


ttttggtagt 


cactgagact 


tcagacacaa 


10020 


cacttacctc 


aaagattttg 


gtcacagata 


ccatcttgtt 


ttcaactgtg 


tccacgccac 


10080 


cttctaaatt 


tccaagtacg 


gggactctgt 


ctggagcttc 


cttccctact 


ttactcccgg 


10140 


acactccaac 


catccctctc 


actgccactg 


agccaacaag 


ttcattagct 


acatcctttg 


10200 


attccacccc 


actggtgact 


atagcttcgg 


atagtcttgg 


cacagtccca 


gagactaccc 


10260 


tgaccatgtc 


agagacctca 


aatggtgatg 


cactggttct 


taagacagta 


agtaacccag 


10320 


ataggagcat 


ccctggaatc 


actatccaag 


gagtaacaga 


aagtccactc 


catccttctt 


10380 


ccacttcccc 


ctctaagatt 


gttgctccac 


ggaatacaac 
Page 


ctatgaaggt 
7 


tcgatcacag 
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tggcactttc 


tactttgcct 


gcgggaacta 


ctggttccct 


tgtattcagt cagagttctg 


10500 


aaaactcaga 


gacaacggct 


ttggtagact 


catcagctgg 


gcttgagagg gcatctgtga 


10560 


tgccactaac 


cacaggaagc 


cagggtatgg 


ctagctctgg 


aggaatcaga agtgggtcca 


10620 


ctcactcaac 


tggaaccaaa 


acattttctt 


ctctccctct 


gaccatgaac ccaggtgagg 


10680 


ttacagccat 


gtctgaaatc 


accacgaaca 


gactgacagc 


tactcaatca acagcaccca 


10740 


aagggatacc 


tgtgaagccc 


accagtgctg 


agtcaggcct 


cctaacacct gtctctgcct 


10800 


cctcaagccc 


atcaaaggcc 


tttgcctcac 


tgactacagc 


tcccccaact tgggggatcc 


10860 


cacagtctac 


cttgacattt 


gagttttctg 


aggtcccaag 


tttggatact aagtccgctt 


10920 


ctttaccaac 


tcctggacag 


tccctgaaca 


ccattccaga 


ctcagatgca agcacagcat 


10980 


cttcctcact 


gtccaagtct 


ccagaaaaaa 


acccaagggc 


aaggatgatg acttccacaa 


11040 


aggccataag 


tgcaagctca 


tttcaatcaa 


caggttttac 


tgaaacccct gagggatctg 


11100 


cctccccttc 


tatggcaggg 


catgaaccca 


gagtccccac 


ttcaggaaca ggggacccta 


11160 


gatatgcctc 


agagagcatg 


tcttatccag 


acccaagcaa 


ggcatcatca gctatgacat 


11220 


cgacctctct 


tgcatcaaaa 


ctcacaactc 


tcttcagcac 


aggtcaagca gcaaggtctg 


11280 


gttctagttc 


ctctcccata 


agcctatcca 


ctgagaaaga 


aacaagcttc ctttccccca 


11340 


ctgcatccac 


ctccagaaag 


acttcactat 


ttcttgggcc 


ttccatggca aggcagccca 


11400 


acatattggt 


gcatcttcag 


acttcagctc 


tgacactttc 


tccaacatcc actctaaata 


11460 


tgtcccagga 


ggagcctcct 


gagttaacct 


caagccagac 


cattgcagaa gaagagggaa 


11520 


caacagctga 


aacacagacg 


ttaaccttca 


caccatctga 


gaccccaaca tccttgttac 


11580 


ctgtctcttc 


tcccacagaa 


cccacagcca 


gaagaaagag 


ttctccagaa acatgggcaa 


11640 


gctctatttc 


agttcctgcc 


aagacctcct 


tggttgaaag 


taagaatgcc ctgctccttc 


11700 


cccaagtgtg 


ctggggatga 


atctggaaat 


aaactacatc 


ttttttattt tttaaacttt 


11760 


tatatttgaa 


aatataaata 


ttttaggttc 


agggaacatg 


tgcaggtttg ttatataggt 


11820 


aaattgcatg 


tcatgggggc 


ttggggtaca 


gattacatca 


tcagccaggt aataagccta 


11880 


gtacctgatc 


agtagatttt 


ttttaatcct 


ctccctcctc 


ccagcctcca ccctcaattc 


11940 


acatgtctcc 


atgtgtactc 


aaggtttaat 


tcccacttat 


gagtgagaac atgcggtatt 


12000 


tgtaaactac 


atctttattt 


ttgctaacct 


cgaactgaaa 


tttagcattt gttttattga 


12060 


tgaatagagg 


taacaaaaca 


aaccacatta 


atcctagcag 


tgcctgtgcc tttgccaaca 


12120 


acagaaattc 


cggacacttt 


catatcctat 


gacaattgtt 


gcaagcactt ttaaaaatca 


12180 


tgtacgactt 


tattcataat 


tatagtggtt 


attaggcttt 


tcaatagatc. ttatttaatg 


12240 


agttagtaaa 


ataagtgcct 


gtattattgt 


attacatttg 


tttattaaga tcttgataac 


12300 


aacatttcaa 


tataatcatt 


tcctttgttt 


tttaaatttt 
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aggtttgtta cgtggatata ctgcataatg atgaggtttg gcttctagtg aacccatcag 12420 

ccaaatagtg aatgttgtgc ccaataagta gtttttcaat cctcacttca ctcccagcct 12480 

cctctatttt ggagtcccag tgtctattat ttctatcttt atgtccacat gtacccattg 12540 

gttagctccc acttataagt gagaatgtgc agtatttaat tttctgtttt tgagttattt 12600 

tgcttaggtt gatggccttc agctccagcc acgttgcttt aaagaacatg atttcattct 12660 

tttttatggc tgcatagtac tccgaggtgt atgtgtacca gattttcttt atccacaatg 12720 

atttcctttg taatctaata ttttatattg ttattttatg ttttattcta tatttttatt 12780 

ttaatttata aaggaattca tatggttcac aagcctgtca aagggaccta taataaaaag 12840 

aggttaagaa tccatgctct aaacagaata ttactccatt ttatttcatt tatttttaaa 12900 

gagacagtct cactctgtca tccaggctgg agtacagtgg agtgatcata gctcattgca 12960 

accctgaact cttgggcaca agcaattctc ctgcttcatc ctccagagga gctgggacta 13020 

caggtgcaca tcaccatgcc cagctagttt taaaaattat tttgtagaga tggtgtctca 13080 

ctatcctacc caggctggtc tcaaactcct gggctcaggc aatcctccca ctttgacctc 13140 

ccaaagtgtt gagattacag gggcaagcca ctgtgcctgg ccacttgtca cattttaatt 13200 

tgtgattact tataaaatga accccttccc atctgagatc tgtcagtctt tctggtgacg 13260 

gtgcctggtg tctgctttct accatgtcct gttagactag tgtttgatgg gaggtcacct 13320 

gggcagctgt ccagctcact cactgggctc tagagcctct gagttgaagc aaaatagaaa 13380 

gatcagtcaa tgtaaagaaa gctcaaaaac tgacattctg aagtaatgga tagctaaacc 13440 

ttcctattgc ccttttcttt cagcaactga tggaacgcta gtgaccacca taaagatgtc 13500 

aagccaggca gcacaaggaa attccacgtg gcctgcccca gcagaggaga cggggaccag 13560 

tccagcaggt aaatatagac cttgtttcca tttctgctct gctaatgcca cccaagcctt 13620 

tcttttcttt tcttttcttt tcttttcttt tcttttcttt tcttttcttt ctctcccttt 13680 

ctttctttct ttctttcttt ctttctttct ttctttcttt ctttctttct ctttctttct 13740 

ttctttcttt ctttctttct ttctttcttt ctttctttct ctttctttct tctttctctc 13800 

tctctctttc tttctttctc ttgttctttt taaatttttt atttttttac ttaatttttt 13860 

tcacccaagc cttaaggcca gtttggacca gatagtgaga ccccacctct ataaaaaaaa 13920 

tttttaaaaa aaaaataagt tgggcatcgt gcaggcctgt agtccctgct actcgagagg 13980 

ccaaggtggg aggacagctt gctgctgact aaaagtgctg cttattgatt ctgggaagaa 14040 

aaaatataca aggcttcagt ttcattattt tataagtaaa tgctagcaac ttttcctttc 14100 

tttctctctt tctctcttcc tctctttctc tcctctcctt ctcttctctc tctctctctc 14160 

tctctctctc tttctctctc ctctccttct cttctcttct ttctctctct ctctctttca 14220 
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tcattctgtc acccaggctg gagtacagtg gtgtatattt 


14280 


actgcagtac 


tcactgtact 


cactgcagcc 


tcaaattcct gggctcaagc 


tatcctctca 


14340 


cctcagcctc 


ctgagtagct 


gggcagcagt 


ccagctcact cactgggctc 


tagagcctct 


14400 


gtgctatgcc 


cagcttattg 


ttgttgtttt 


tttaaatttt tttttttgta 


cagatggggt 


14460 


ctcactatgt 


ggcccaaqqt 


ggtcttaaac 


tcctggctcc aagagatcct 


cccacctcag 


14520 


cctcccaaag 


tgcagggatt 


acaggtgtga 


gccactgtgc ccagcctaga 


cagcattttt 


14580 


tttttttgaa 


acagggtctc 


cctctgttgc 


ccaggctgga gtgcaatggc 


gtgttcatgg 


14640 


ttcactgcag 


cctcagcctc 


ctcagtctca 


agcaatcctc caacttcagc 


ctcccccaac 


14700 


agctagaact 


gcagqtgatc 


atcaccaatt 


agcctggtta attgtgtgtg 


tatttcttaa 


14760 


attttttgta 


gagatagttc 


tcactatatt 


gcttgggctg gtctcaaact 


cctggactca 


14820 


agtgattcac 


ctacctcggc 


ctccctaagc 


actgggatta caggcttgag 


ccaccacacc 


14880 


cggcaaggac 


taggttttaa 


aataggttcc 


taggctgggt gtggtggctt 


acgcccgtaa 


14940 


tcccagcact 


ttQaaaaact 


gaggtgggcg 


gatcacgagg tcaggagttt 


gagaccagcc 


15000 


tggccaacat 


agtgaaaccc 


tgtctctact 


aaaaatacaa aaaattagct 


gggcatagtg 


15060 


gcacacacct 


gtaatcccag 


ctactcggga 


ggctgaggaa ggagaatcac 


ttgaacctgg 


15120 


gaggcggagg 


ttacaataaa 


ccgagatcat 


gccattgctc tccagcctgg 


gtgacagagc 


15180 


aagactccat 


ctaaaaaaaa 


aaaaaaaagt 


tcctttgact tcttgacact 


cttctctgag 


15240 


gatattgatc 


atttttcccc 


aatagatgtt 


actaattgaa cacttctgtt 


gcttcaactt 


15300 


actaatttac 


atgatcaata 


gccaattaat 


tcagcaggag agaatgctac 


agagtcgatt 


15360 


ctttctgtac 


tttcttctgc 


tccagagtga 


aggatctttc taaatcagag 


accatcactg 


15420 


tgttcacagg 


gagggcctag 


gtgaacctga 


gatggcaaat gttgcgtttg 


ttctacggaa 


15480 


gaagggatta 


tgggttgaag 


tccttggcag 


tgccaaattg cttagaaaaa 


tgtgaaatat 


15540 


ggtccctagg 


agtgctcttgo ggatgtcaca 


tttttctcac tcctttgaca 


ggtagatgtt 


15600 


attttcctga 


aggccaggga aaggattcag 


agggaggaat gaatttgaaa 


gaaaatgaag 


15660 


gtgacgagaa 


agaatgagct catctccctt 


atcctctttc ttctcaaatc 


cttaagtagc 


15720 


tttgcagtga 


actaagattt gggggaacct 


agaggaggct gaaagttgga 


agctgaaatt 


15780 


ggcttagcaa 


gggcaagctc caaagacaaa 


agtqgaaata qtttqqqqqt 


agccttttgc 


15840 


atgggtgaaa 


tcctggttca tcacatcctc 


ccttatgcaa agagcccttt 


tatatggggc 


15900 


atggggaaaa 


actgagctaa aggtgataat 


ttctcctgag caagccagat 


ggtcaaagct 


15960 


ctaacttcac 


catctccctt ggaatgttta 


atgtgttccc tggtgtccag 


aggcttaacg 


16020 


tgtgagaatt 


aaaagctcaa 


cattttcttt 


cccagggaag gaggaaatag 


ttttaattga 


16080 


aatcccggga 


ggaaatgaat gatagtgtca 


aaccaaaaaa cttcatcttc 
Page 10 


tgtaccactt 


16140 



wo 2004/0455S3 



PCTAJS2003/037041 




02 243 8. 43867 (pct2) .ST25.txt 



gcatatactc cactgactta ctttctaatc acaggcacat ccccaggaag cccagaaatg 16200 
tctaccactc tcaaaatcat gagctccaag gaacccggca tcagcccaga gatcaggtcc 16260 
actgtgagaa attctccttg gaagactcca gaaacaactg ttcccatgga gaccacagtg 16320 



cccacaggaa ccacatcacc aaccaagtca ccaacagaaa atatgttggc tacagaaagg 16440 

gtctccctct ccccatcccc acctgaggct tggaccaacc tttattctgg aactccagga 16500 

gggaccaggc agtcactggc cacaatgtcc tctgtctccc tagagtcacc aactgctaga 16560 

agcatcacag ggactggtca gcaaagcagt ccagaactgg ttttaaagac aactggaatg 16620 

gaattctcta tgtggcatgg ctctactgga gggaccacag gggacacaca tgtctctctg 16680 

agcacatctt ccaatatcct tgaagaccct gtaaccagcc caaactctgt gagctcattg 16740 

acagataaat ccaaacataa aaccgagaca tgggtcagca ccacagccat tccctccact 16800 

gtcctgaata ataagataat ggcagctgaa caacagacaa gtcgatctgt ggatgaggct 16860 

tattcatcaa ctagttcttg gtcagatcag acatctggga gtgacatcac ccttggtgca 16920 

tctcctgatg tcacaaacac attatacatc acctccacag cacaaaccac ctcactagta 16980 

tctctgccct ctggagacca aggcattaca agcctcacca atccctcagg aggaaaaaca 17040 

agctctgcat catctgtcac atctccttca atagggcttg agactctgat ggccaatgta 17100 

agtgcagtga caagtgacat tgcccctact gctgggcatc tatctcagac ttcatctcct 17160 

gcggaagtga gcatcctgga cataaccaca gctcctactc caggtatctc caccaccatc 17220 

accaccatgg gaaccaactc aatctcaact accacaccca acccagaagt gggtatgagt 17280 

accatggaca gcaccccggc cacagagagg cacacaactt ctacagaaca cccttccacc 17340 

tggtcttcca cagctgcatc agattcctgg actgtcacag acatgacttc aaacttgaaa 17400 

gttgcaagat ctcctggaac aatttccaca atgcatacaa cttcattctt agcctcaagc 17460 

actgaattag actccatgtc tactccccat ggccgtataa ctgtcattgg aaccagcctg 17520 

gtcactccat cctctgatgc ttcagctgta aagacagaga ccagtacaag tgaaagaaca 17580 

ttgagtcctt cagacacaac tgcatctact cccatctcaa ctttttctcg tgtccagagg 17640 

atgagcatct cagttcctga cattttaagt acaagttgga ctcccagtag tacagaagca 17700 

gaagatgtgc ctgtttcaat ggtttctaca gatcatgcta gtacaaagac tgacccaaat 17760 

atgcccctgt ccacttttct gtttgattct ctgtccactc ttgactggga cactgggaga 17820 

tctctgtcat cagccacagc cactacctca gctcctcagg gggccacaac tccccaagaa 17880 

ctcactttgg aaaccatgat cagcccagct acctcacagt tgcccttctc tatagggcac 17940 

attacaagtg cagtcatacc agctgcaatg gcaaggagct ctggagttac tttttcaaga 18000 



gaaccagtca cccttcagtc cacagcccta ggaagtggca gcaccagcat ctctcacctg 
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ccagatccca 


caagcaaaaa 


ggcagagcag 


acttccactc 


agcttcccac 


caccacttct 


18060 


gcacatccag 


agcaggtgcc 


cagatcagca 


gcaacaactc 


tggatgtgat 


cccacacaca 


18120 


gcaaaaactc 


cagatgcaac 


ttttcagaga 


caagggcaga 


cagctcttac 


aacagaggca 


18180 


agagctacat 


ctgactcctg 


gaatgagaaa 


gaaaaatcaa 


ccccaagtgc 


accttggatc 


18240 


actgagatga 


tgaattctgt 


ctcagaagat 


accatcaagg 


aggttaccag 


ctcctccagt 


18300 


gtgttaagga 


ccctgaatac 


gctggacata 


aacttggaat 


ctgggacgac 


ttcatcccca 


18360 


c^gttggaaaa 


gcagcccata 


tgagagaatt 


gccccttctg 


agtctaccac 


agacaaagag 


18420 


gcaattcacc 


cttctacaaa 


cacagtagag 


accactggct 


gggtcacaag 


ttccgaacat 


18480 


gcttctcatt 


ccactatccc 


agcccactca 


gcgtcatcca 


aactcacatc 


tccagtggtt 


18540 


acaacctcca 


ccagggaaca 


agcaatagtt 


tctatgtcaa 


caaccacatg 


gccagagtct 


18600 


acaagggcta 


gaacagagcc 


taattccttc 


ttgactattg 


aactgaggga 


cgtcagccct 


18660 


tacatggaca 


ccagctcaac 


cacacaaaca 


agttttatct 


cttccccagg 


ttccactgcg 


18720 


atcaccaagg 


ggcctagaac 


agaaattacc 


tcctctaaga 


gaatatccag 


ctcattcctt 


18780 


gcccagtcta 


tgaggtcgtc 


agacagcccc 


tcagaagcca 


tctccaggct 


gtctaacttt 


18840 


cctgccatga 


cagaatctgg 


aggaatgatc 


cttgctatgc 


aaacaagtcc 


acctggcgct 


18900 


acatcactaa 


gtgcacctac 


tttggataca 


tcagccacag 


cctcctggac 


agggactcca 


18960 


ctggctacga 


ctcagagatt 


tacatactca 


gagaagacca 


ctctctttag 


caaaggtcct 


19020 


gaggatacat 


cacagccaag 


ccctccctct 


gtggaagaaa 


ccagctcttc 


ctcttccctg 


19080 


gtacctatca 


atgctacaac 


ctcgccttcc 


aatattttgt 


tgacatcaca 


agggcacagt 


19140 


ccctcctcta 


ctccacctgt 


gacctcagtt 


ttcttgtctg 


agacctctgg 


cctggggaag 


19200 


accacagaca 


tgtcgaggat 


aagcttggaa 


cctggcacaa 


gtttacctcc 


caatttgagc 


19260 


agtacagcag 


gtgaggcgtt 


atccacttat 


gaagcctcca 


gagatacaaa 


ggcaattcat 


19320 


cattctgcag 


acacagcagt 


gacgaatatg 


gaggcaacca 


gttctgaata 


ttctcctatc 


19380 


ccaggccata 


caaagccatc 


caaagccaca 


tctccattgg 


ttacctccca 


catcatgggg 


19440 


gacatcactt 


cttccacatc 


agtatttggc 


tcctccgaga 


ccacagagat 


tgagacagtg 


19500 


tcctctgtga 


accagggact 


tcaggagaga 


agcacatccc 


aggtggccag 


ctctgctaca 


19560 


gagacaagca 


ctgtcattac 


ccatgtgtct 


agtggtgatg 


ctactactca 


tgtcaccaag 


19620 


acacaagcca 


ctttctctag 


cggaacatcc 


atctcaagcc 


ctcatcagtt 


tataacttct 


19680 


accaacacat 


ttacagatgt 


gagcaccaac 


ccctccacct 


ctctgataat 


gacagaatct 


19740 


tcaggagtga 


ccatcaccac 


ccaaacaggt 


cctactggag 


ctgcaacaca 


gggtccatat 


19800 


ctcttggaca 


catcaaccat 


gccttacttg 


acagagactc 


cattagctgt 


gactccagat 


19860 


tttatgcaat 


cagagaagac 


cactctcata 


agcaaaggtc 
Page 


ccaaggatgt 
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gtcctggaca 
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agccctccct ctgtggcaga aaccagctat ccctcttccc tgacaccttt cttggtcaca 19980 

accatacctc ctgccacttc cacgttacaa gggcaacata catcctctcc tgtttctgcg 20040 

acttcagttc ttacctctgg actggtgaag accacagata tgttgaacac aagcatggaa 20100 

cctgtgacca attcacctca aaatttgaac aatccatcaa atgagatact ggccactttg 20160 

gcagccacca cagatataga gactattcat ccttccataa acaaagcagt gaccaatatg 20220 

gggactgcca gttcagcaca tgtactgcat tccactctcc cagtcagctc agaaccatct 20280 

acagccacat ctccaatggt tcctgcctcc agcatggggg acgctcttgc ttctatatca 20340 

atacctggtt ctgagaccac agacattgag ggagagccaa catcctccct gactgctgga 20400 

cgaaaagaga acagcaccct ccaggagatg aactcaacta cagagtcaaa catcatcctc 20460 

tccaatgtgt ctgtgggggc tattactgaa gccacaaaaa tggaagtccc ctcttttgat 20520 

gcaacattca taccaactcc tgctcagtca acaaagttcc cagatatttt ctcagtagcc 20580 

agcagtagac tttcaaactc tcctcccatg acaatatcta cccacatgac caccacccag 20640 

acagggtctt ctggagctac atcaaagatt ccacttgcct tagacacatc aaccttggaa 20700 

acctcagcag ggactccatc agtggtgact gaggggtttg cccactcaaa aataaccact 20760 

gcaatgaaca atgatgtcaa ggacgtgtca cagacaaacc ctccctttca ggatgaagcc 20820 

agctctccct cttctcaagc acctgtcctt gtcacaacct taccttcttc tgttgctttc 20880 

acaccgcaat ggcacagtac ctcctctcct gtttctatgt cctcagttct tacttcttca 20940 

ctggtaaaga ccgcaggcaa ggtggataca agcttagaaa cagtgaccag ttcacctcaa 21000 

agtatgagca acactttgga tgacatatcg gtcacttcag cagccaccac agatatagag 21060 * 

acaacgcatc cttccataaa cacagtagtt accaatgtgg ggaccaccgg ttcagcattt 21120 

gaatcacatt ctactgtctc agcttaccca gagccatcta aagtcacatc tccaaatgtt 21180 

accacctcca ccatggaaga caccacaatt tccagatcaa tacctaaatc ctctaagact 21240 

acaagaactg agactgagac aacttcctcc ctgactccta aactgaggga gaccagcgtc 21300 

tcccaggaga tcacctcgtc cacagagaca agcactgttc cttacaaaga gctcactggt 21360 

gccactaccg aggtatccag gacagatgtc acttcctcta gcagtacatc cttccctggc 21420 

cctgatcagt ccacagtgtc actagacatc tccacagaaa ccaacaccag gctgtctacc 21480 

tccccaataa tgacagaatc tgcagaaata accatcacca cccaaacagg tcctcatggg 21540 

gctacatcac aggatacttt taccatggac ccatcaaata caacccccca ggcagggatc 21600 

cactcagcta tgactcatgg attttcacaa ttggatgtga ccactcttat gagcagaatt 21660 

ccacaggatg tatcatggac aagtcctccc tctgtggata aaaccagctc cccctcttcc 21720 

tttctgccct cacctgcaat gaccacacct tccctgattt cttctacctt accagaggat 21780 
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ttcacttctc acctctggcc tagtgaagat tacagacata 


21840 


ttacgtacac 


gcttggaacc 


tgtgaccagc 


tcacttccaa 


atttcagcag 


cacctcagat 


21900 


aagatactgg 


ccacttctaa 


agacagtaaa 


gacacaaagg 


aaatttttcc 


ttctataaac 


21960 


acagaagaga 


ccaatgtgaa 


agccaacaac 


tctggacatg 


aatcccattc 


ccctgcactg 


22020 


gctgactcag 


agacacccaa 


agccacaact 


caaatggtta 


tcaccaccac 


tgtgggagat 


22080 


ccagctcctt 


ccacatcaat 


gccagtgcat 


ggttcctctg 


agactacaaa 


cattaagaga 


22140 


gagccaacat 


atttcttgac 


tcctagactg 


agagagacca 


gtacctctca 


ggagtccagc 


22200 


tttcccacgg 


acacaagttt 


tctactttcc 


aaagtcccca 


ctggtactat 


tactgaggtc 


22260 


tccagtacag 


gggtcatctc 


ttctagcaaa 


atttccaccc 


cagaccatga 


taagtccaca 


22320 


gtgccacctg 


acaccttcac 


aggagagatc 


cccagggtct 


tcacctcctc 


tattaagaca 


22380 


aaatctgcag 


aaatgacgat 


caccacccaa 


gcaagtcctc 


ctgagtctgc 


atcgcacagt 


22440 


acccttccct 


tggacacatc 


aaccacactt 


tcccagggag 


ggactcattc 


aactgtgact 


22500 


cagggattcc 


catactcaga 


ggtgaccact 


ctcatgggca 


tgggtcctgg 


gaatgtgtca 


22560 


tggatgacaa 


ctccccctgt 


ggaagaaacc 


agctctgtgt 


cttccctgat 


gtcttcacct 


22620 


gccatgacat 


ccccttctcc 


tgtttcctcc 


acatcaccac 


agagcatccc 


ctcctctcct 


22680 


cttcctgtga 


ctgcacttcc 


tacttctgtt 


ctggtgacaa 


ccacagatgt 


gttgggcaca 


22740 


acaagcccag 


agtctgtaac 


cagttcacct 


ccaaatttga 


gcagcatcac 


tcatgagaga 


22800 


ccggccactt 


acaaagacac 


tgcacacaca 


gaagccgcca 


tgcatcattc 


cacaaacacc 


22860 


gcag;tgacca 


atgtagggac 


ttccgggtct 


ggacataaat 


cacaatcctc 


tgtcctagct 


22920 


gactcagaga 


catcgaaagc 


cacacctctg 


atgagtacca 


cctcca cct 


gg&ggacaca 


22980 


agtgtttcca 


catcaactcc 


taatatctct 


cagactaacc 


aaattcaaac 


agagccaaca 


23040 


gcatccctga 


gccctagact 


gagggagagc 


agcacgtctg 


agaagaccag 


ctcaacaaca 


23100 


gagacaaata 


ctgccttttc 


ttatgtgccc 


acaggtgcta 


ttactcaggc 


ctccagaaca 


23160 


gaaatctcct 


ctagcagaac 


atccatctca 


gaccttgatc 


ggtccacaat 


agcacccgac 


23220 


atctccacag 


gaatgatcac 


caggctcttc 


acctccccca 


tcatgacaaa 


atctgcagaa 


23280 


atgaccgtca 


ccactcaaac 


aactactcct 


ggggctacat 


cacagggtat 


ccttccctgg 


23340 


gacacatcaa 


ccacactttt 


ccagggaggg 


actcattcaa 


ccgtgtctca 


gggattccca 


23400 


cactcagaga 


taaccactct 


tcggagcaga 


acccctggag 


atgtgtcatg 


gatgacaact 


23460 


ccccctgtgg 


aagaaaccag 


ctctgggttt 


tccctgatgt 


caccttccat 


gacatcccct 


23520 


tctcctgttt 


cctccacatc 


accagagagc 


atcccctcct 


ctcctctccc 


tgtgactgca 


23580 


cttcttactt 


ctgttctggt 


gacaaccaca 


aatgtattgg 


gcacaacaag 


cccagagccc 


23640 


gtaacgagtt 


cacctccaaa 


tttaagcagc 


cccacacagg agagactgac 
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gacactgcgc acacagaagc catgcatgct tccatgcata caaacactgc agtggccaac 23760 

gtggggacct ccatttctgg acatgaatca caatcttctg tcccagctga ttcagacaca 23820 

tccaaagcca catctccaat gggtaccacc ttcgccatgg gggatacaag tgtttctaca 23880 

tcaactcctg ccttctttga gactagaatt cagactgaat caacatcctc tttgattcct 23940 

ggattaaggg acaccaggac gtctgaggag atcaacactg tgacagagac cagcactgtc 24000 

ctttcagaag tgcccactac tactactact gaggtctcca ggacagaagt tatcacttcc 24060 

agcagaacaa ccatctcagg gcctgatcat tccaaaatgt caccctacat ctccacagaa 24120 

accatcacca ggctctccac ttttcctttt gtaacaggat ccacagaaat ggccatcacc 24180 

aaccaaacag gtcctatagg gactatctca caggctaccc ttaccctgga cacatcaagc 24240 

acagcttcct gggaagggac tcactcacct gtgactcaga gatttccaca ctcagaggag 24300 

accactacta tgagcagaag tactaagggc gtgtcatggc aaagccctcc ctctgtggaa 24360 

gaaaccagtt ctccttcttc cccagtgcct ttacctgcaa taacctcaca ttcatctctt 24420 

tattccgcag tatcaggaag tagccccact tctgctctcc ctgtgacttc ccttctcacc 24480 

tctggcagga ggaagaccat agacatgttg gacacacact cagaacttgt gaccagctcc 24540 

ttaccaagtg caagtagctt ctcaggtgag atactcactt ctgaagcctc cacaaataca 24600 

gagacaattc acttttcaga gaacacagca gaaaccaata tggggaccac caattctatg 24660 

cataaactac attcctctgt ctcaatccac tcccagccat ccggacacac acctccaaag 24720 

gttactggat ctatgatgga ggacgctatt gtttccacat caacacctgg ttctcctgag 24780 

actaaaaatg ttgacagaga ctcaacatcc cctctgactc ctgaactgaa agaggacagc 24840 

accgccctgg tgatgaactc aactacagag tcaaacactg ttttctccag tgtgtccctg 24900 

gatgctgcta ctgaggtctc cagggcagaa gtcacctact atgatcctac attcatgcca 24960 

gcttctgctc agtcaacaaa gtccccagac atttcacctg aagccagcag cagtcattct 25020 

aactctcctc ccttgacaat atctacacac aagaccatcg ccacacaaac aggtccttct 25080 

ggggtgacat ctcttggcca actgaccctg gacacatcaa ccatagccac ctcagcagga 25140 

actccatcag ccagaactca ggattttgta gattcagaaa caaccagtgt catgaacaat 25200 

gatctcaatg atgtgttgaa gacaagccct ttctctgcag aagaagccaa ctctctctct 25260 

tctcaggcac ctctccttgt gacaacctca ccttctcctg taacttccac attgcaagag 25320 

cacagtacct cctctcttgt ttctgtgacc tcagtaccca cccctacact ggcgaagatc 25380 

acagacatgg acacaaactt agaacctgtg actcgttcac ctcaaaattt aaggaacacc 25440 

ttggccactt cagaagccac cacagataca cacacaatgc atccttctat aaacacagca 25500 

gtggccaatg tggggaccac cagttcacca aatgaattct attttactgt ctcacctgac 25560 
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tcagacccat ataaagccac atccgcagta gttatcactt ccacctcggg ggactcaata 25620 

gtttccacat caatgcctag atcctctgcg atgaaaaaga ttgagtctga gacaactttc 25680 

tccctgatat ttagactgag ggagactagc acctcccaga aaattggctc atcctcagac 25740 

acaagcacgg tctttgacaa agcattcact gctgctacta ctgaggtctc cagaacagaa 25800 

ctcacctcct ctagcagaac atccatccaa ggcactgaaa agcccacaat gtcaccggac 25860 

acctccacaa gatctgtcac catgctttct acttttgctg gcctgacaaa atccgaagaa 25920 

aggaccattg ccacccaaac aggtcctcat agggcgacat cacagggtac ccttacctgg 25980 

gacacatcaa tcacaacctc acaggcaggg acccactcag ctatgactca tggattttca 26040 

caattagatt tgtccactct tacgagtaga gttcctgagt acatatcagg gacaagccca 26100 

ccctctgtgg aaaaaaccag ctcttcctct tcccttctgt ctttaccagc aataacctca 26160 

ccgtcccctg tacctactac attaccagaa agtaggccgt cttctcctgt tcatctgact 26220 

tcactcccca cctctggcct agtgaagacc acagatatgc tggcatctgt ggccagttta 26280 

cctccaaact tgggcagcac ctcacataag ataccgacta cttcagaaga cattaaagat 26340 

acagagaaaa tgtatccttc cacaaacata gcagtaacca atgtggggac caccacttct 26400 

gaaaaggaat cttattcgtc tgtcccagcc tactcagaac cacccaaagt cacctctcca 26460 

atggttacct ctttcaacat aagggacacc attgtttcca catccatgcc tggctcctct 26520 

gagattacaa ggattgagat ggagtcaaca ttctccctgg ctcatgggct gaagggaacc 26580 

agcacctccc aggaccccat cgtatccaca gagaaaagtg ctgtccttca caagttgacc 26640 

actggtgcta ctgagacctc taggacagaa gttgcctctt ctagaagaac atccattcca 26700 

ggccctgatc attccacaga gtcaccagac atctccactg aagtgatccc cagcctgcct 26760 

atctcccttg gcattacaga atcttcaaat atgaccatca tcactcgaac aggtcctcct 26820 

cttggctcta catcacaggg cacatttacc ttggacacac caactacatc ctccagggca 26880 

ggaacacact cgatggcgac tcaggaattt ccacactcag aaatgaccac tgtcatgaac 26940 

aaggaccctg agattctatc atggacaatc cctccttcta tagagaaaac cagcttctcc 27000 

tcttccctga tgccttcacc agccatgact tcacctcctg tttcctcaac attaccaaag 27060 

accattcaca ccactccttc tcctatgacc tcactgctca cccctagcct agtgatgacc 27120 

acagacacat tgggcacaag cccagaacct acaaccagtt cacctccaaa tttgagcagt 27180 

acctcacatg agatactgac aacagatgaa gacaccacag ctatagaagc catgcatcct 27240 

tccacaagca cagcagcgac taatgtggaa accaccagtt ctggacatgg gtcacaatcc 27300 

tctgtcctag ctgactcaga aaaaaccaag gccacagctc caatggatac cacctccacc 27360 

atggggcata caactgtttc cacatcaatg tctgtttcct ctgagactac aaaaattaag 27420 

agagagtcaa catattcctt gactcctgga ctgagagaga ccagcatttc ccaaaatgcc 27480 
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agcttttcca ctgacacaag tattgttctt tcagaagtcc ccactggtac tactgctgag 27540 

gtctccagga cagaagtcac ctcctctggt agaacatcca tccctggccc ttctcagtcc 27600 

acagttttgc cagaaatatc cacaagaaca atgacaaggc tctttgcctc gcccaccatg 27660 

acagaatcag cagaaatgac catccccact caaacaggtc cttctgggtc tacctcacag 27720 

gataccctta ccttggacac atccaccaca aagtcccagg caaagactca ttcaactttg 27780 

actcagagat ttccacactc agagatgacc actctcatga gcagaggtcc tggagatatg 27840 

tcatggcaaa gctctccctc tctggaaaat cccagctctc tcccttccct gctgtcttta 27900 

cctgccacaa cctcacctcc tcccatttcc tccacattac cagtgactat ctcctcctct 27960 

cctcttcctg tgacttcact tctcacctct agcccggtaa cgaccacaga catgttacac 28020 

acaagcccag aacttgtaac cagttcacct ccaaagctga gccacacttc agatgagaga 28080 

ctgaccactg gcaaggacac cacaaataca gaagctgtgc atccttccac aaacacagca 28140 

gcgtccaatg tggagattcc cagctctgga catgaatccc cttcctctgc cttagctgac 28200 

tcagagacat ccaaagccac atcaccaatg tttattacct ccacccagga ggatacaact 28260 

gttgccatat caacccctca cttcttggag actagcagaa ttcagaaaga gtcaatttcc 28320 

tccctgagcc ctaaattgag ggagacaggc agttctgtgg agacaagctc agccatagag 28380 

acaagtgctg tcctttctga agtgtccgtt ggtgctacta ctgagatctc caggacagaa 28440 

gtcacctcct ctagcagaac atccatctct ggttctgctg agtccacaat gttgccagaa 28500 

atatccacca caagaaaaat cattaagttc cctacttccc ccatcctggc agaatcatca 28560 

gaaatgacca tcaagaccca aacaagtcct cctgggtcta catcagagag tacctttaca 28620 

ttagacacat caaccactcc ctccttggta ataacccatt cgactatgac tcagagattg 28680 

ccacactcag agataaccac tcttgtgagt agaggtgctg gggatgtgcc acggcccagc 28740 

tctctccctg tggaagaaac aagccctcca tcttcccagc tgtctttatc tgccatgatc 28800 

tcaccttctc ctgtttcttc cacattacca gcaagtagcc actcctcttc tgcttctgtg 28860 

acttcacttc tcacaccagg ccaagtgaag actactgagg tgttggacgc aagtgcagaa 28920 

cctgaaacca gttcacctcc aagtttgagc agcacctcag ttgaaatact ggccacctct 28980 

gaagtcacca cagatacgga gaaaattcat cctttctcaa acacggcagt aaccaaagtt 29040 

ggaacttcca gttctggaca tgaatcccct tcctctgtcc tacctgactc agagacaacc 29100 

aaagccacat cggcaatggg taccatctcc attatggggg atacaagtgt ttctacatta 29160 

actcctgcct tatctaacac taggaaaatt cagtcagagc cagcttcctc actgaccacc 29220 

agattgaggg agaccagcac ctctgaagag accagcttag ccacagaagc aaacactgtt 29280 
ctttctaaag tgtccactgg tgctactact gaggtctcca ggacagaagc catctccttt 29340 
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agcagaacat 


ccatgtcagg 
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ccctgagcag tccacaatgt cacaagacat ctccatagga 


29400 


accatcccca 


ggatttctgc 


ctcctctgtc 


ctgacagaat ctgcaaaaat 


gaccatcaca 


29460 


acccaaacag 


gtccttcgga 


gtctacacta 


gaaagtaccc ttaatttgaa 


cacagcaacc 


29520 


acaccctctt 


gggtggaaac 


ccactctata 


gtaattcagg gatttccaca 


cccagagatg 


29580 


accacttcca 


tgggcagagg 


tcctggaggt 


gtgtcatggc ctagccctcc 


ctttgtgaaa 


29640 


gaaaccagcc 


ctccatcctc 


cccgctgtct 


ttacctgccg tgacctcacc 


tcatcctgtt 


29700 


tccaccacat 


0 

tcctagcaca 


tatccccccc 


tctccccttc ctgtgacttc 


acttctcacc 


29760 


tctggcccgg 


cgacaaccac 


agatatcttg 


ggtacaagca cagaacctgg 


aaccagttca 


29820 


tcttcaagtt 


tgagcaccac 


ctcccatgag 


agactgacca cttacaaaga 


cactgcacat 


29880 


acagaagccg 


tgcatccttc 


cacaaacaca 


ggagggacca atgtggcaac 


caccagctct 


29940 


ggatataaat 


cacagtcctc 


tgtcctagct 


gactcatctc caatgtgtac 


cacctccacc 


30000 


atgggggata 


caagtgttct 


cacatcaact 


cctgccttcc ttgagactag 


gaggattcag 


30060 


acagagctag 


cttcctccct 


gacccctgga 


ttgagggagt ccagtggctc 


tgaagggacc 


30120 


agctcaggca 


ccaagatgag 


cactgtcctc 


tctaaagtgc ccactggtgc 


tactactgag 


30180 


atctccaagg 


aagacgtcac 


ctccatccca 


ggtcccgctc aatccacaat 


atcaccagac 


30240 


atctccacaa 


gaaccgtcag 


ctggttctct 


acatcccctg tcatgacaga 


atcagcagaa 


30300 


ataaccatga 


acacccatac 


aagtccttta 


ggggccacaa cacaaggcac 


cagtactttg 


30360 


gccacgtcaa 


gcacaacctc 


tttgacaatg 


acacactcaa ctatatctca 


aggattttca 


30420 


cactcacaga 


tgagcactct 


tatgaggagg 


ggtcctgagg atgtatcatg 


gatgagccct 


30480 


ccccttctgg 


aaaaaactag 


accttccttt 


tctctgatgt cttcaccagc 


cacaacttca 


30540 


ccttctcctg 


tttcctccac 


attaccagag 


agcatctctt cctctcctct 


tcctgtgact 


30600 


tcactcctca 


cgtctggctt 


ggcaaaaact 


acagatatgt tgcacaaaag 


ctcagaacct 


30660 


gtaaccaact 


cacctgcaaa 


tttgagcagc 


acctcagttg aaatactggc 


cacctctgaa 


30720 


gtcaccacag 


atacagagaa 


aactcatcct 


tcttcaaaca gaacagtgac 


cgatgtgggg 


30780 


acctccagtt 


ctggacatga 


atccacttcc 


tttgtcctag ctgactcaca 


gacatccaaa 


30840 


gtcacatctc 


caatggttat 


tacctccacc 


atggaggata cgagtgtctc 


cacatcaact 


30900 


cctggctttt 


ttgagactag 


cagaattcag 


acagaaccaa catcctccct 


gacccttgga 


30960 


ctgagaaaga 


ccagcagctc 


tgaggggacc 


agcttagcca cagagatgag 


cactgtcctt 


31020 


tctggagtgc 


ccactggtgc 


cactgctgaa 


gtctccagga cagaagtcac 


ctcctctagc 


31080 


agaacatcca 


tctcaggctt 


tgctcagctc 


acagtgtcac cagagacttc 


cacagaaacc 


31140 


atcaccagac 


tccctacctc 


cagcataatg 


acagaatcag cagaaatgat 


gatcaagaca 


31200 


caaacagatc 


ctcctgggtc 


tacaccagag 


agtactcata ctgtggacat 
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cccaactggg tagaaaccca ctcgactgtg actcagagat tttcacactc agagatgacc 31320 



accagctctg cctcttccct gctgtctctg cctgccacga cctcaccttc tcctgtttcc 31440 



ggcctggtga taaccacaga caggatgggc ataagcagag aacctggaac cagttccact 31560 

tcaaatttga gcagcacctc ccatgagaga ctgaccactt tggaagacac tgtagataca 31620 

gaagacatgc agccttccac acacacagca gtgaccaacg tgaggacctc catttctgga 31680 

catgaatcac aatcttctgt cctatctgac tcagagacac ccaaagccac atctccaatg 31740 

ggtaccacct acaccatggg ggaaacgagt gtttccatat ccacttctga cttctttgag 31800 

accagcagaa ttcagataga accaacatcc tccctgactt ctggattgag ggagaccagc 31860 

agctctgaga ggatcagctc agccacagag ggaagcactg tcctttctga agtgcccagt 31920 

ggtgctacca ctgaggtctc caggacagaa gtgatatcct ctaggggaac atccatgtca 31980 

gggcctgatc agttcaccat atcaccagac atctctactg aagcgatcac caggctttct 32040 

acttccccca ttatgacaga atcagcagaa agtgccatca ctattgagac aggttctcct 32100 

ggggctacat cagagggtac cctcaccttg gacacctcaa caacaacctt ttggtcaggg 32160 

acccactcaa ctgcatctcc aggattttca cactcagaga tgaccactct tatgagtaga 32220 

actcctggag atgtgccatg gccgagcctt ccctctgtgg aagaagccag ctctgtctct 32280 

tcctcactgt cttcacctgc catgacctca acttcttttt tctccacatt accagagagc 32340 

atctcctcct ctcctcatcc tgtgactgca cttctcaccc ttggcccagt gaagaccaca 32400 

gacatgttgc gcacaagctc agaacctgaa accagttcac ctccaaattt gagcagcacc 32460 

tcagctgaaa tattagccac gtctgaagtc accaaagata gagagaaaat tcatccctcc 32520 

tcaaacacac ctgtagtcaa tgtagggact gtgatttata aacatctatc cccttcctct 32580 

gttttggctg acttagtgac aacaaaaccc acatctccaa tggctaccac ctccactctg 32640 

gggaatacaa gtgtttccac atcaactcct gccttcccag aaactatgat gacacagcca 32700 

acttcctccc tgacttctgg attaagggag atcagtacct ctcaagagac cagctcagca 32760 

acagagagaa gtgcttctct ttctggaatg cccactggtg ctactactaa ggtctccaga 32820 

acagaagccc tctccttagg cagaacatcc accccaggtc ctgctcaatc cacaatatca 32880 

ccagaaatct ccacggaaac catcactaga atttctactc ccctcaccac gacaggatca 32940 

gcagaaatga ccatcacccc caaaacaggt cattctgggg catcctcaca aggtaccttt 33000 

accttggaca catcaagcag agcctcctgg ccaggaactc actcagctgc aactcacaga 33060 

tctccacact cagggatgac cactcctatg agcagaggtc ctgaggatgt gtcatggcca 33120 



actcttgtga gcagaagccc tggtgatatg ttatggccta gtcaatcctc tgtggaagaa 



31380 



tctacattag tagaggattt cccttccgct tctcttcctg tgacttctct tctcacccct 



31500 
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cagtggaaaa 
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aactagccct ccatcttccc tggtgtcttt atctgcagta 


33180 


acctcacctt 


cgccacttta 


ttccacacca 


tctgagagta 


gccactcatc 


tcctctccgg 


33240 


gtgacttctc 


ttttcacccc 


tgtcatgatg 


aagaccacag 


acatgttgga 


cacaagcttg 


33300 


gaacctgtga 


ccacttcacc 


tcccagtatg 


aatatcacct 


cagatgagag 


tctggccact 


33360 


tctaaagcca 


ccatggagac 


agaggcaatt 


cagctttcag 


aaaacacagc 


tgtgactcag 


33420 


atgggcacca 


tcagcgctag 


acaagaattc 


tattcctctt 


atccaggcct 


cccagagcca 


33480 


tccaaagtga 


catctccagt 


ggtcacctct 


tccaccataa 


aagacattgt 


ttctacaacc 


33540 


atacctgctt 


cctctgagat 


aacaagaatt 


gagatggagt 


caacatccac 


cctgaccccc 


33600 


acaccaaggg 


agaccagcac 


ctcccaggag 


atccactcag 


ccacaaagcc 


aagcactgtt 


33660 


ccttacaagg 


cactcactag 


tgccacgatt 


gaggactcca 


tgacacaagt 


catgtcctct 


33720 


agcagaggac 


ctagccctga 


tcagtccaca 


atgtcacaag 


acatatccag 


tgaagtgatc 


33780 


accaggctct 


ctacctcccc 


catcaaggca 


gaatctacag 


aaatgaccat 


taccacccaa 


33840 


acaggttctc 


ctggggctac 


atcaaggggt 


acccttacct 


tggacacttc 


aacaactttt 


33900 


atgtcaggga 


cccactcaac 


tgcatctcaa 


ggattttcac 


actcacagat 


gaccgctctt 


33960 


atgagtagaa 


ctcctggaga 


tgtgccatgg 


ctaagccatc 


cctctgtgga 


agaagccagc 


34020 


tctgcctctt 


tctcactgtc 


ttcacctgtc 


atgacctcat 


cttctcccgt 


ttcttccaca 


34080 


ttaccagaca 


gcatccactc 


ttcttcgctt 


cctgtgacat 


cacttctcac 


ctcagggctg 


34140 


gtgaagacca 


cagagctgtt 


gggcacaagc 


tcagaacctg 


aaaccagttc 


acccccaaat 


34200 


ttgagcagca 


cctcagctga 


aatactggcc 


accactgaag 


tcactacaga 


tacagagaaa 


34260 


ctggagatga 


ccaatgtggt 


aacctcaggt 


tatacacatg 


aatctccttc 


ctctgtccta 


34320 


gctgactcag 


tgacaacaaa 


ggccacatct 


tcaatgggta 


tcacctaccc 


cacaggagat 


34380 


acaaatgttc 


tcacatcaac 


ccctgccttc 


tctgacaccn 


nnnnnnnnnn 


nnnnnnnnnn 


34440 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


34500 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnncggaaa 


ccaagtttct 


aaccaacccc 


34560 


tcctttttga 


ccccagtagg 


attcaaacaa 


agtcaaagct 


ctcactgact 


cctgggttga 


34620 


tggagaccag 


catctctgaa 


gagaccagct 


ctgccacaga 


aaaaagcact 


gtcctttcta* 


34680 


gtgtgcccac 


tggtgctact 


actgaggtct 


ccaggacaga 


agccatctct 


tctagcagaa 


34740 


catccatccc 


aggccctgct 


caatccacaa 


tgtcatcaga 


cacctccatg 


gaaaccatca 


34800 


ctagaatttc 


tacccccctc 


acaaggaaag 


aatcaacaga 


catggccatc 


acccccaaaa 


34860 


caggtccttc 


tggggctacc 


tcgcagggta 


cctttacctt 


ggactcatca 


agcacagcct 


34920 


cctggccagg 


aactcactca 


gctacaactc 


agagatttcc 


acagtcagtg 


gtgacaactc 


34980 


ctatgagcag 


aggtcctgag 


gatgtgtcat 


ggccaagccc 
Page 


gctgtctgtg 
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gaaaaaaaca 


35040 
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gccctccatc ttccctggta tcttcatctt cagtaacctc accttcgcca ctttattcca 35100 

caccatctgg gagtagccac tcctctcctg tccctgtcac ttctcttttc acctctatca 35160 

tgatgaaggc cacagacatg^ttggatgcaa gtttggaacc tgagaccact tcagctccca 35220 

atatgaatat cacctcagat gagagtctgg ccacttctaa agccaccacg gagacagagg 35280 

caattcacgt ttttgaaaat acagcagcgt cccatgtgga aaccaccagt gctacagagg 35340 

aactctattc ctcttcccca ggcttctcag agccaacaaa agtgatatct ccagtggtca 35400 

cctcttcctc tataagagac aacatggttt ccacaacaat gcctggctcc tctggcatta 35460 

caaggattga gatagagtca atgtcatctc tgacccctgg actgagggag accagaacct 35520 

cccaggacat cacctcatcc acagagacaa gcactgtcct ttacaagatg tcctctggtg 35580 

ccactcctga ggtctccagg acagaagtta tgccctctag cagaacatcc attcctggcc 35640 

ctgctcagtc cacaatgtca ctagacatct ccgatgaagt tgtcaccagg ctgtctacct 35700 

ctcccatcat gacagaatct gcagaaataa ccatcaccac ccaaacaggt tattctctgg 35760 

ctacatccca ggttaccctt cccttgggca cctcaatgac ctttttgtca gggacccact 35820 

caactatgtc tcaaggactt tcacactcag agatgaccaa tcttatgagc aggggtcctg 35880 

aaagtctgtc atggacgagc cctcgctttg tggaaacaac tagatcttcc tcttctctga 35940 

catcattacc tctcacgacc tcactttctc ctgtgtcctc cacattacta gacagtagcc 36000 

cctcctctcc tcttcctgtg acttcactta tcctcccagg cctggtgaag actacagaag 36060 

tgttggatac aagctcagag cctaaaacca gttcatctcc aaatttgagc agcacctcag 36120 

ttgaaatacc ggccacctct gaaatcatga cagatacaga gaaaattcat ccttcctcaa 36180 

acacagcggt ggccaaagtg aggacctcca gttctgttca tgaatctcat tcctctgtcc 36240 

tagctgactc agaaacaacc ataaccatac cttcaatggg tatcacctcc gctgtggacg 36300 

ataccactgt tttcacatca aatcctgcct tctctgagac taggaggatt ccgacagagc 36360 

caacattctc attgactcct ggattcaggg agactagcac ctctgaagag accacctcaa 36420 

tcacagaaac aagtgcagtc ctttatggag tgcccactag tgctactact gaagtctcca 36480 

tgacagaaat catgtcctct aatagaacac acatccctga ctctgatcag tccacgatgt 36540 

ctccagacat catcactgaa gtgatcacca ggctctcttc ctcatccatg atgtcagaat 36600 

caacacaaat gaccatcacc acccaaaaaa gttctcctgg ggctacagca cagagtactc 36660 

ttaccttggc cacaacaaca gcccccttgg caaggaccca ctcaactgtt cctcctagat 36720 

ttttacactc agagatgaca actcttatga gtaggagtcc tgaaaatcca tcatggaaga 36780 

gctctccctt tgtggaaaaa actagctctt catcttctct gttgtcctta cctgtcacga 36840 

cctcaccttc tgtttcttcc acattaccgc agagtatccc ttcctcctct ttttctgtga 36900 
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cttcactcct 


caccccaggc 


atggtgaaga 


ctacagacac 


aagcacagaa 


cctggaacca 


36960 


gtttatctcc 


aaatctgagt 


ggcacctcag 


ttgaaatact 


ggctgcctct 


gaagtcacca 


37020 


cagatacaga 


gaaaattcat 


ccttcttcaa 


gcatggcagt 


gaccaatgtg 


ggaaccacca 


37080 


gttctggaca 


tgaactatat 


tcctctgttt 


caatccactc 


ggagccatcc 


aaggctacat 


37140 


acccagtggg 


tactccctct 


tccatggctg 


aaacctctat 


ttccacatca 


atgcctgcta 


37200 


attttgagac 


cacaggattt 


gaggctgagc 


cattttctca 


tttgacttct 


ggatttagga 


37260 


agacaaacat 


gtccctggac 


accagctcag 


tcacaccaac 


aaatacacct 


tcttctcctg 


37320 


ggtccactca 


ccttttacag 


agttccaaga 


ctgatttcac 


ctcttctgca 


aaaacatcat 


37380 


ccccagactg 


gcctccagcc 


tcacagtata 


ctgaaattcc 


agtggacata 


atcaccccct 


37440 


ttaatgcttc 


tccatctatt 


acggagtcca 


ctgggataac 


ctccttccca 


gaatccaggt 


37500 


ttactatgtc 


tgtaacagaa 


agtactcatc 


atctgagtac 


agatttgctg 


ccttcagctg 


37560 


agactatttc 


cactggcaca 


gtgatgcctt 


ctctatcaga 


ggccatgact 


tcatttgcca 


37620 


ccactggagt 


tccacgagcc 


atctcaggtt 


caggtagtcc 


attctctagg 


acagagtcag 


37680 


gccctgggga 


tgctactctg 


tccaccattg 


cagagagcct 


gccttcatcc 


actcctgtgc 


37740 


cattctcctc 


ttcaaccttc 


actaccactg 


attcttcaac 


catcccagcc 


ctccatgaga 


37800 


taacttcctc 


ttcagctacc 


ccatatagag 


tggacaccag 


tcttgggaca 


gagagcagca 


37860 


ctactgaagg 


acgcttggtt 


atggtcagta 


ctttggacac 


ttcaagccaa 


ccaggcagga 


37920 


catcttcaac 


acccattttg 


gataccagaa 


tgacagagag 


cgttgagctg 


ggaacagtga 


37980 


caagtgctta 


tcaagttcct 


tcactctcaa 


cacggttgac 


aagagaatgc 


gcatggcgag 


38040 


aagggagaag 


tgtagttgga 


tggataaaag 


gaagaatgga 


gagaagc ntg 


aatugaagga 


38100 


agcaaagatg 


aagcggagga 


aggatagatg 


cacagaagga 


aggatgaaaa 


gaaagaaaga 


38160 


tgatggaaga 


caggattgaa 


ggggatatag 


attgaaggaa 


agaaaggtag 


aaggatgaaa 


38220 


tgaagtaaag 


attgaagaaa 


agatggatgg 


aaagaagaaa 


ggagggtgca 


caaaaaatct 


38280 


cacacttcac 


cacatatgat 


tcatccatat 


aagaaaaaac 


cacttgtacc 


ctcaaagcta 


38340 


ttgaaataca 


aacttttaaa 


ttaaaatttt 


aaaaagcaag 


agaaaggaaa 


gaagggagga 


38400 


aagacaaaag 


gaagaatggg 


tgatagaagg 


aaagaataaa 


aggaagaaaa 


aatggaagaa 


38460 


tagatgatca 


gatctaggga 


tgaatgaaag 


gaaggatgga 


caaatctata 


ggtaggtgga 


38520 


tggatctatg 


gacaggtgtg 


gccacttatg 


gcacatagtc 


ccagctccag 


ttcatactga 


38580 


tggacttgag 


gagtgtttgt 


ggccaatgaa 


gtggatccat 


ttagacagtg 


ctcttcttct 


38640 


gaatgagatg 


agttacccca 


gtttttctcc 


ccaccttcat 


cttcaggaac 


tgatggcatt 


38700 


atggaacaca 


tcacaaaaat 


acccaatgaa 


gcagcacaca 


gaggtaccat 


aagaccagtc 


38760 


aaaggccctc 


agacatccac 


ttcgcctgcc 


agtcctaaag 
Page 


gtaggtttaa 
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ctttgcttac 


38820 
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38880 



ctcccagtaa tgccactcgt gaccatattt cctcctccag agagacaaaa tgtttgtatt 

ctttagagag agaattgtgt gtggttgtca taggtttccc tgtctgaact gagtctttat 38940 

ctaatggtta ccaggcagat gttaccactg tctctttctc ctcatggcat gctgagtgag 39000 

ttttgtccaa catcaaatat tcacaaattt gtccatatta accaaatttt aaaaatgctc 39060 

attaaaaact tactatgagc tgggcgcagt ggctcatgcc tgtaatccca atactttggg 39120 

aggctgagct gggtggatca ccagaggtca aaaattcgag accagtctga ccaaaatggt 39180 

gaaactccat ctctactgaa aatataaaaa ttagccgggc atggtggcac acaccgtaat 39240 

cacagctact caggaggctg aggcaagaga gtcacttgaa ccacaggagg tagaggctgc 39300 

agtgagctga gcattgtgcc aatgcactcc agcctgggtg gcagagcaag actccagctc 39360 

agaaataaat aatatattat atatatatat atatgtttta tttagatgga atatactata 39420 

tatatatgta tatatatatg tatgtatata tatatatgta tgtatatata tatatatata 39480 

tatatatata tatatagaga gagagagaga gagagagaga gagagagaca gagtatgtct 39540 

gagaatgcat cccgatagtt ctagcaaggt aggaaaagga agtatcataa cagccttgaa 39600 

gtagcctgtt gaaacagaca gactctcttg taagagaact cacaaaatct aggattatat 39660 

ctcccatgat gaaaaatttg gaactgtaca tttttgttta actgtcactt aaatnnnnnn 39720 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nccaggaggc 39780 

actgtgcttg gcgccttttt accaacactt tgagatggcc attgtactta tccccacttt 39840 

atagacggga aaatggaggt ccagcaatat tttttaactt aaagagccac ccatctcttt 39900 

agagaaagag ccagaatccc aggcaggggc tatcttattc cagagcccaa gctctcaaac 39960 

acatg-taca caatacttaa tctctctcaa gtcagaggag atccacttaa gtatacatcc 40020 

atccacatat tcattcattc aatcattcaa caaatattag ttgagcactt accgtatgcc 40080 

aaacagtcaa acgtgaatag ctgttacaaa tgagactgtg aaggatggta caacgcagat 40140 

tcagacagtg tgataaggaa atattgagaa gcaaagatga gttctggagt gaatttgtaa 40200 

aggtggatgt gggcttggat ttcaataatg gcagaactta aggaatctga tgagaagtgg 40260 

gcacttcagg cagagagaag agcttgaaca aggctcagag gctgacagtg caggaaacac 40320 

atgggaagag ggaatagagt agcggtcaag aattcacaga ggagttatag gtgaagatgc 40380 

aaccaagtta cagaccaagg taagataggg gaataccaat cacaatctct tttcccattc 40440 

cagaagcatc ccagacacat cctagtaacc gagagacatt tctctccctt tcctcctgtg 40500 

gagaataaat aagctattgc aagtccagta agtgtaatca ttttgttcaa attgtgtgcc 40560 

cattccccaa tttacaggac tacacacagg agggacaaaa agaatggaga ccacaaccac 40620 

agctctgaag accaccacca cagctctgaa gaccacttcc agagccacct tgaccaccag 40680 
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tgtctatact 


cccactttgg 


gaacactgac 


tcccctcaat 


gcatcaatgc 


aaatggccag 


40740 


cacaatcccc 


acagaaatga 


tgatcacaac 


cccatatgtt 


ttccctgatg 


ttccagaaac 


40800 


gacatcctca 


ttggctacca 


gcctgggagc 


agaaaccagc 


acagctcttc 


ccaggacaac 


40860 


cccatctgtt 


ttcaatagag 


aatcagagac 


cacagcctca 


ctggtctctc 


gttctggggc 


40920 


agagagaagt 


ccggttattc 


aaactctaga 


tgtttcttct 


agtgagccag 


atacaacagc 


40980 


ttcatgggtt 


atccatcctg 


cagagaccat 


cccaactgtt 


tccaagacaa 


cccccaattt 


41040 


tttccacagt 


gaattagaca 


ctgtatcttc 


cacagccacc 


agtcatgggg 


cagacgtcag 


41100 


ctcagccatt 


ccaacaaata 


tctcacctag 


tgaactagat 


gcactgaccc 


cactggtcac 


41160 


tatttcgggg 


acagatacta 


gtacaacatt 


cccaacactg 


actaagtccc 


cacatgaaac 


41220 


agagacaaga 


accacatggc 


tcactcatcc 


tgcagagacc 


agctcaacta 


ttcccagaac 


41280 


aatccccaat 


ttttctcatc 


atgaatcaga 


tgccacacct 


tcaatagcca 


ccagtcctgg 


41340 


ggcagaaacc 


agttcagcta 


ttccaattat 


gactgtctca 


cctggtgcag 


aagatctggt 


41400 


gacctcacag 


gtcactagtt 


ctgggacaga 


cagaaatatg 


actattccaa 


ctttgactct 


41460 


ttctcctggt 


gaaccaaaga 


cgatagcctc 


attagtcacc catcctgaag 


cacagacaag 


41520 


ttcggccatt 


ccaacttcaa 


ctatctcgcc 


tgctgtatca cggttggtga 


cctcaatggt 


41580 


caccagtttg 


gcggcaaaga 


caagtacaac 


taatcgagct ctgacaaact 


cccctggtga 


41640 


accagctaca 


acagtttcat 


tggtcacgca 


tcctgcacag accagcccaa 


cagttccctg 


41700 


gacaacttcc 


atttttttcc 


atagtaaatc 


agacaccaca ccttcaatga 


ccaccagtca 


41760 


tggggcagaa 


tccagttcag 


ctgttccaac 


tccaactgtt tcaactgagg 


taccaggagt 


41820 


agtgacccct 


ttggtcacca 


gttctagggc 


agtgatcagt acaactattc 


caattctgac 


41880 


tctttctcct 


ggtgaaccag 


agaccacacc 


ttcaatggcc accagtcatg 


gggaagaagc 


41940 


cagttctgct 


attccaactc 


caactgtttc 


acctggggta ccaggagtgg 


tgacctctct 


42000 


ggtcactagt 


tctagggcag 


tgactagtac 


aactattcca attctgactt 


tttctcttgg 


42060 


tgaaccagag 


accacacctt 


caatggccac 


cagtcatggg acagaagctg 


gctcagctgt 


42120 


tccaactgtt 


ttacctgagg 


taccaggaat 


ggtgacctct 


ctggttgcta 


gttctagggc 


42180 


agtaaccagt 


acaactcttc 


caactctgac 


tctttctcct 


ggtgaaccag 


agaccacacc 


42240 


ttcaatggcc 


accagtcatg 


gggcagaagc 


cagctcaact 


gttccaactg 


tttcacctga 


42300 


ggtaccagga 


gtggtgacct 


ctctggtcac 


tagttctagt ggagtaaaca 


gtacaagtat 


42360 


tccaactctg 


attctttctc 


ctggtgaact 


agaaaccaca ccttcaatgg 


ccaccagtca 


42420 


tggggcagaa 


gccagctcag 


ctgttccaac 


tccaactgtt tcacctgggg 


tatcaggagt 


42480 


ggtgacccct 


ctggtcacta 


gttccagggc 


agtgaccagt acaactattc 


caattctaac 


42540 


tctttcttct 


agtgagccag 


agaccacacc 


ttcaatggcc accagtcatg 
Page 24 


gggtagaagc 


42600 



wo 2004/045553 



PCT/US2003/037041 



022438 . 43867 (pct2) . ST25 . txt 
cagctcagct gttctaactg tttcacctga ggtaccagga atggtgacct ctctggtcac 42660 
tagttctaga gcagtaacca gtacaactat tccaactctg actatttctt ctgatgaacc 42720 
agagaccaca acttcattgg tcacccattc tgaggcaaag atgatttcag ccattccaac 42780 
tttagctgtc tcccctactg tacaagggct ggtgacttca ctggtcacta gttctgggtc 42840 
agagaccagt gcgttttcaa atctaactgt tgcctcaagt caaccagaga ccatagactc 42900 
atgggtcgct catcctggga cagaagcaag ttctgttgtt ccaactttga ctgtctccac 42960 
tggtgagccg tttacaaata tctcattggt cacccatcct gcagagagta gctcaactct 43020 
tcccaggaca acctcaaggt tttcccacag tgaattagac actatgcctt ctacagtcac 43080 
cagtcctgag gcagaatcca gctcagccat ttcaacaact atttcacctg gtataccagg 43140 
tgtgctgaca tcactggtca ctagctctgg gagagacatc agtgcaactt ttccaacagt 43200 
gcctgagtcc ccacatgaat cagaggcaac agcctcatgg gttactcatc ctgcagtcac 43260 
cagcacaaca gttcccagga caacccctaa ttattctcat agtgaaccag acaccacacc 43320 
atcaatagcc accagtcctg gggcagaagc cacttcagat tttccaacaa taactgtctc 43380 
acctgatgta ccagatatgg taacctcaca ggtcactagt tctgggacag acaccagtat 43440 
aactattcca actctgactc tttcttctgg tgagccagag accacaacct catttatcac 43500 
ctattctgag acacacacaa gttcagccat tccaactctc cctgtctccc ctggtgcatc 43560 
aaagatgctg acctcactgg tcatcagttc tgggacagac agcactacaa ctttcccaac 43620 
actgacggag accccatatg aaccagagac aacagccata cagctcattc atcctgcaga 43680 
gaccaacaca atggttccca ggacaactcc caagttttcc catagtaagt cagacaccac 43740 
actcccagta gccatcacca gtcctgggcc agaagccagt tcagctgttt caacgacaac 43800 
tatctcacct gatatgtcag atctggtgac ctcactggtc cctagttctg ggacagacac 43860 
cagtacaacc ttcccaacat tgagtgagac cccatatgaa ccagagacta cagccacgtg 43920 
gctcactcat cctgcagaaa ccagcacaac ggtttctggg acaattccca acttttccca 43980 
taggggatca gacactgcac cctcaatggt caccagtcct ggagtagaca cgaggtcagg 44040 
tgttccaact acaaccatcc cacccagtat accaggggta gtgacctcac aggtcactag 44100 
ttctgcaaca gacactagta cagctattcc aactttgact ccttctcctg gtgaaccaga 44160 
gaccacagcc tcatcagcta cccatcctgg gacacagact ggcttcactg ttccaattcg 44220 
gactgttccc tctagtgagc cagatacaat ggcttcctgg gtcactcatc ctccacagac 44280 
cagcacacct gtttccagaa caacctccag tttttcccat agtagtccag atgccacacc 44340 
tgtaatggcc accagtccta ggacagaagc cagttcagct gtactgacaa caatctcacc 44400 
tggtgcacca gagatggtga cttcacagat cactagttct ggggcagcaa ccagtacaac 44460 
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ctcctggtat gccagagacc acagccttat tgagcaccca 


44520 


tcccagaaca 


gagacaagta 


aaacatttcc 


tgcttcaact gtgtttcctc 


aagtatcaga 


44580 


gaccacagcc 


tcactcacca 


ttagacctgg 


tgcagagact agcacagctc 


tcccaactca 


44640 


gacaacatcc 


tctctcttca 


ccctacttgt 


aactggaacc agcagagttg 


atctaagtcc 


44700 


aactgcttca 


cctggtgttt 


ctgcaaaaac 


agccccactt tccacccatc 


cagggacaga 


44760 


aaccagcaca 


atgattccaa 


cttcaactct 


ttcccttggt ttactagaga 


ctacaggctt 


44820 


actggccacc 


agctcttcag 


cagagaccag 


cacgagtact ctaactctga 


ctgtttcccc 


44880 


tgctgtctct 


gggctttcca 


gtgcctctat 


aacaactgat aagccccaaa 


ctgtgacctc 


44940 


ctggaacaca 


gaaacctcac 


catctgtaac 


ttcagttgga cccccagaat 


tttccaggac 


45000 


tgtcacaggc 


accactatga 


ccttgatacc 


atcagagatg ccaacaccac 


ctaaaaccag 


45060 


tcatggagaa 


ggagtgagtc 


caaccactat 


cttgagaact acaatggttg 


aagccactaa 


45120 


tttagctacc 


acaggttcca 


gtcccactgt 


ggccaagaca acaaccacct 


tcaatacact 


45180 


ggctggaagc 


ctctttactc 


ctctgaccac 


acctgggatg tccaccttgg 


cctctgagag 


45240 


tgtgacctca 


agaacaagta 


agaataactt 


ttttattgtg gtaaaatata 


aatactataa 


45300 


aaattgccat 


tctaaacatt 


ttaattgtac 


aactcagcag tactaataca 


ttcacattgt 


45360 


tgtgcaaccc 


tcaccactat 


ctgttttcaa 


aacttttttt atcaccccaa 


acaggactga 


45420 


aggaataatt 


tcccattccc 


cattctccct 


agtgcagtgg tgcaatctcg 


gctcaccaca 


45480 


acctctgaac 


ctctgtctcc 


tgggttcaag 


caattctcct gcatcagcct 


cctgagtagt 


45540 


tgggactaca 


ggtgcacgcc 


accgtgcctg 


gctaattttt gtatttttag 


tacagacagg 


45600 


gttttaccat 


gttggtcagg 


ctggtctcaa 


actcctgacc tcaggtggtc 


cacacgcctt 


45660 


ggcctcccaa 


agtgctggga 


ttacaagtgt 


gagacactgt gcccggccat 


atctgttaga 


45720 


tcttactaat 


cctgtcaaga 


ggattcagtg 


tccttttttt tttttctttc 


ttttttttga 


45780 


tagagtctcc 


ctctggcacc 


caggctggag 


tgcagtggta cggtcttggc 


tcactgcagc 


45840 


ctccacctcc 


cagactgaag 


cgattctcct 


gcctcagcct cccgaatagc 


tgggactaca 


45900 


ggcgcgtgcc 


accacgccca 


gctaattttt 


gcatttttag tagagatggg 


atttcactat 


45960 


gttggccagg 


ctggtctcaa 


actcctgatc 


tcaagtgatc cgcccaaggg 


cctcccaaag 


46020 


tactgggatt 


acaggtagga 


gccacctcac 


ctggccctat tttcggaatg 


gatttttttt 


46080 


taatgtttaa 


aatgtcacct 


aagattattg 


tgaagatcaa ataagataaa 


atcctaataa 


46140 


cccaagtaaa 


ccacagggct 


ccacttggac 


cagtctcaga agtttcaaga 


aaatcagtca 


46200 


gaccatcaaa 


tgtaaaataa 


gtctaaattt 


tctttgcact attcacagag 


tgccaaagag 


46260 


gatctaattc 


atgtttcaga 


acatacccta 


cttactaaaa tccccttttc 


ctcatttctt 


46320 


ctcattctgc 


aactttatca 


tctcctgcgg 


accccctagc ctctcccctc 
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cccatagtca 
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gtctctctct 


ctctctttcc 


ctcccctctt 


attatctcaa 


tttcacacga 


aagaattcca 


46440 


gaaactatac 


tgccaaaagt 


ctttcctgtc 


tttgaaaagt 


tgggaaagag 


gagaaactca 


46500 


aacaqcaatQ 


acaaaattat 


acgtaatgga 


tgaaggaaac 


acaaataagg 


ctggaaacag 


46560 


aaaattttgt 


ccccatcatt 


tatttaatga 


^ggtggcagt 


attccagcca 


catagtgaac 


46620 


ccccacaata 


agaaggggcc 


tctggcgatt 


gattattgtc 


attgttgtta 


atgataatga 


46680 


aQataaciqat 


atcatgagca 


tcagtgtagg 


aggcagttaa 


ctaataagac 


caagctgttg 


46740 




ctggttcaca 


cctgcagtcc 


cagcactttg 


ggaggccaaa 


gtgggtggat 


46800 


cacttaaqat 


caggagttca 


agactagcct 


ggccaacatg 


gtgaaacctg 


gtctctacca 


46860 


aaaatacaaa 


aattagtcag 


gtgtggtggc 

9 ^9 ■•»9 **93^ 


gtgtgcctgt 


aatgacaact 


acttgggagg 


46920 


ctaaaacaaa 


agaatcactt 


Qaacctqqqa 


ggcqgaggct 


gcagtgagat 


gagcttgaac 


46980 


cactacactc 


caacccaaQc 


aacagagaga 


gactcttgtc 


tcaaaaaaca 


aaacaaacaa 


47040 


acaaaaacta 

^» *A ******** w** 


aaccaaacaa 


aaaaagacta 


gctgttattc 


atttatttat 


ttatttattt 


47100 


aaaaacQQaQ 


tctcgctctg 


tcacccaggc 


tggagtgcag 


cggcacaatc 


ttggctcact 


47160 


gcaacctctg 


cctcccaggt 


tcatgtgatt 


ctcccgcctc 


agcctcccca 


gctgttgtta 


47220 


ttcatgaatg 


aacctcagag 


aaagcacaca 


ggagggttgg 

97999 99 


tgcacctgtg 


ttttgagttc 


47280 


tacccctcct 


tcctctctta 


acttcctcct 


gtcttctcac 


tctgattcgt 


tcttccttcc 


47340 


tcticcctctc 


tctctacaoa 


ttataaccat 


cggtcctgga 


tctccaccac 


cagcagtgag 


47400 


taaacataac 


cctgaagtcc 


ctatgccctg 


ggaattcttc 


ctccctaagc 


ctgccttcca 


47460 


aaaaaaaaat 


atcccccatt 


ccctaggttc 


tcatccccac 


agaaactcca 


gaatagcaaa 


47520 


aatictcaaac 


tgagccaagg 


cacagatgcc 


agtgctcacc 


aagagtccta 


ttctcccctc 


47580 


gctaaatgat 


aggacccaac 


aaacccgatt 


cacgctgcgt 


tttctttcag 


ctccgatgac 


47640 


ctccatattc 


tctccaaggc 


ctctcgtatc 


tgtgagcccc 


acccccagcg 


ctacaggtag 


47700 


aaatctaact 


tccagctccc 


atgaaacgtc 


ggctgccatt 


cagtggctga 


ttaattgctg 


47760 


tataatctaa 


gtcctgatgc 


ccaccaagtc 


tcagcgtgtt 


cccctctgtc 


caatctcatc 


47820 


caacaattta 


agctaatgct 


tgtttaatga 


tgtcctcact 


ataccacctt 


ggacactttc 


47880 


tttttgcctg 


gatttaaagc 


ttccatttct 


ttccttcctt 


ccttcttttc 


ttccttcctt 


47940 










k wVb>« k www 


tccttccttc 


48000 


cttcctttct 


tcctttcttc 


ctgtcttttt 


ctttctttcc 


ttcttttggc 


agagtctcac 


48060 


tctgtcgccc 


aggctggagt 


gcaatggtgc 


aatctcggtt 


cactgcaacc 


tctgcctccc 


48120 


aggttcaagc 


gattctcatg 


ccacatgcca 


ctatgcctgg 


ctaatttttg 


tttttttgtt 


48180 


ttttgggggg 


ttttttgaga 


cagagtctca 


gtctgttgcc 


caagctggag 


tgcagtggca 


48240 
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tgatctcggg 


tcactgcaac 


ctccttctcc 


caggttcaag 


cgattttcct 


gcctcagcct 


48300 


cctgagtagc 


tggaactaca 


ggcacgcacc 


atcacaccgg 


ctaatttttt 


gtgtttttag 


48360 


tagagacgac 


ggttttgcaa 


tgtgggccag 


gcttgtctcg 


aactcctgac 


ctcaagtgat 


48420 


cctccagcct 


cggcctctca 


aagtgctggg 


attacaagtg 


tgagccactg 


caccaggcca 


48480 


aaaacttgta 


tttcaatagt 


cattgaggct 


gggtgcagtg 


gctcacgcct 


gtaatcccag 


48540 


cactttggga 


ggctgaggcc 


agtggatcat 


gaggtcagga 


gatcaagacc 


accctggcta 


48600 


acacagtgaa 


accccatctc 


tactaaaaat 


acacacaaaa 


attagccggg 


catggtggca 


48660 


agatgcctgt 


agtcccagct 


actcaggagg 


ctgaggcagg 


agaatggcgt 


gaacctggga 


48720 


ggcagagctt 


gcagtgagcg 


gagatcgcac 


cgctgcactc 


cagcctgggc 


aacagagagc 


48780 


gactctgtct 


caaaaaaaaa 


aatatatata 


tatatatata 


tatattcatt 


gagaccgact 


48840 


ctgacttaaa 


agcagtaatg 


aatggtgtag 


gttttggtaa 


attacaggtc 


ttgctttaag 


48900 


tcctggtcct 


ctcttttgct 


cactgtgtgg 


ccccggaaga 


gccatgtaac 


ctctccaggc 


48960 


ttcagtgtcc 


atttttagaa 


cggagtaagt 


gaataagctg 


tgtccaatca 


tctctggcca 


49020 


tatcagcttc 


attttttttt 


tcctccaggg 


tccaaacatc 


cctccaccct 


cagagtcttt 


49080 


gcacctggtg 


ttcttgtcct 


tcaaatctca 


gcttggatca 


ccctttataa 


agtagcattt 


49140 


cccccgtata 


cgcatcttgc 


acacagccaa 


tctctattct 


acctctatgc 


tcacttcctt 


49200 


cctggcaatt 


attactacag 


ctgggccctt 


gaacagcatg 


agggttcagg 


gtgctgaccc 


49260 


ctatgcattc 


aaaaatccac 


atataacttt 


tttttttttg 


agatggagtt 


tcacacttgt 


49320 


tgcccaggct 


ggagtgcagt 


ggcgccatct 


tggctcactg 


caaactctgc 


ctcctgggtt 


49380 


caagtgattc 


tcctgcctca 


gcctcctgag 


tagctgggat 


tacaggcatg 


tgccaccatg 


49440 


cccagctaat 


tttgtatttt 


tagtagagat 


gaggtttctc 


catgttcgcc 


aggctgctct 


49500 


tgaactcctg 


acttcaggtg 


atccgcctgc 


cttggcctcc 


caaagtgctg 


ggattacagg 


49560 


catgagccat 


gatgcccggc 


catttgctaa 


tggcatctag 


taagtagagg 


ccagagatgt 


49620 


tgcaaaacat 


ccaacaatgc 


acaaagcagc 


ctcctatcaa 


aacacattat 


ccagaccaaa 


49680 


atgtcaatag 


ggctgaggtt 


gagcatctgc 


tgtacacaga 


ttccaagttc 


tggtacaaat 


49740 


ctcgtagttc 


tctgagggct 


catctttcaa 


tgcctagcac 


atcaaaggag 


gccaatttcc 


49800 


tcttcccttt 


cacctcctgg 


tatgaaatgt 


ttcctcctcc 


accttgatcc 


tgtaagagcc 


49860 


cagcuggagti 


Lxgcagacga 


cggggaaaga 


aaugggxgag 


ggagggtccx 


atggttgagt 




ctccgcagtg 


ggccctgggt 


gcccagttca 


ccctcctccc 


cttcattttc 


tccatcatga 


49980 


caactcaagg 


caaattctca 


gtttccatgg 


gccagtggaa 


tccactgact 


tcatgaaata 


50040 


accccaccct 


gagcaaatac 


ccctcaaata 


ataactgttt 


acacaacatc 


agtggcaaca 


50100 


atgacccaag 


cagcaatgcc 


accaccagaa 


tagcaaccat 
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cattttcatc 


50160 
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aaaaggaaac tgtagggcca ggcacagtgg ctcacaccta tattcccagc attttgggag 50220 
gctgaggcag gcagatcacc tgaggtcagg agttcaagac cagcccagcc aacatggtga 50280 



ctagctactc gggaggctga ggcaggagaa ttgcttgaac ctgggaggca gaggttgcag 50400 

tgagctgaga ttgtgccact gcactccagc ctgggcgaca gagcaagact ccgtctgaaa 50460 

aaaaaaaaaa aaggaattgt gccaggaatt gtgatgagaa ctttatatgc attatctcct 50520 

attaatatta cccaaacctc cgtgagttac tatactcatt tctacagaga gcatttatgc 50580 

atccagggag gaagtaatta gcccagaatt actcagttat gacacaggac agtatgaaaa 50640 

ctccaaccga agattggaga ctcatgaaaa ctccaggctc ctaactacaa gacatcactg 50700 

tggatcgtcc aaatagagca agccccaatc tcaggacagg aatgaggcat gaatggcctc 50760 

tatgctaatg atctaaccta atgctgaatt tgttacttcc cttctgaatc cacttggaga 50820 

tttcctttat atctgacttg aaatagagga tatatactcc tctatccttg acataggaga 50880 

taatacacag aaagtatttc attgtagtat caagtacaca tcctgttctg tgtccatagg 50940 

attatgacta atttagggca tggcttaaca gtgtggtact attgaatgac agacagatgt 51000 

ctgttttgtt ggatgcagga caagccatgt aacctcccca gactttagtg tcccctctgt 51060 

ggaatggaat aaaaatacta cgtgggattg ttctgataat caaatgagat aattcaggaa 51120 

caacccagat aaataacagg gctgccctgg gttctgtctt tccttgtatc tctcacagag 51180 

cctcaaagga gatgcaatcc atgacctaga gaaacactca ggacaaattc tcttttcccc 51240 

agttcctttc ttgctccaat ggcaacacca cccctctcat cctgaagtct cttgttttta 51300 

ccaccacacc tattttgcca aattttctcc aatattccaa accatatgaa acctttcttt 51360 

ctttcttttc tttccttcct ttccttcttt ctttcttttt tctcttcttt tcttttcttt 51420 

ttgagacatg gtctcactct gttgcacagg ctggagtgca atggcacgat ctttgctcac 51480 

tgcaacctcc gcctcccagg ttcaagagat tctcttgcct cagcctcctg agtagctggg 51540 

attacaggcg cccaccgcca cgccacgcta atttttgtgt tcttagtgga gacggggttt 51600 

cgccatgttg gccaggctgg tcttgaactc ctgacctcaa gtgatttgcc catctcggtc 51660 

tcccaaagtg ctaggattac aggcgtgagc caccaagccc ggccccatat gaaccgtttc 51720 

tatccctcat ttctctgtac ttttacctaa aaacaccact cccttcaccc atcacatttt 51780 

tgtcaattct acatcacaca cacacacaca cacacacaca cacacacaga gaaagtaagt 51840 

tggaaaaaaa ttatactatc atgaaatttt gtgaaaggag gtaagctgag agagtaagaa 51900 

tcaaactaaa ttatctttat gggtagaaag cacactcatc catacatgtg tctttccacc 51960 

cttgtaatgt atttattatt attgtttgta tatactagat tcccaataaa tagggacagc 52020 



aaccccatct ctactaaaaa tacaaaaact agccaggctt ggtggcatgt gcctgtaatc 



50340 
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tattatggta 


tttttatttc 
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aggaiataata atagtgatga tttccaccat tattgtcaaa 


52080 


ggacaaagca 


caaaatatgt 




ai.aLayv»wau uaL«_VpULwaL 


Lwa^aaaaya 


52140 


tcttggcccc 


acctcttctc 




\>^aLyai- L Ly LL^aai.LLi.y 


y v.i»a\*x v.xy y 


52200 


gctgagagat 


ggaggttccc 


*^ "t" n f n a n f" n 


aay LLa^awa LL.yaayy Lyy 


aayLLLi. lcLi 


52260 


cctccctctg 


gctggctgag 


fina +*a n f"f"c"a 


ria+rtnrt!^+^a 'l"#"a"l"naaan+ 
yelLyyyvXi..a xcaxyaaayx 


XXvivCaXXaX 


52320 


ttccatttct 


ggatctacca 




CCCXaCCXCX CaCCCaXuaX 


aaxxgxccxx 


52380 


ctttactctt 


tcctccctat 


c-ugccigy L La 


xaaccgxcgg xacxggaccc 


^■^Ai^^a ^#*aM 

cxgccaccag 


52440 


cagtgagtat 


tcaaacctgt 


yaLcLLLCCcla 


xycccxxggg aCCCXXCCXC 


cccaayyxgc 


52500 


attcctcaga 


agagaaactg 




cxcccxacgx gcccagccac 


agccxcagag 


52560 


cagcccctaa 


cccgtcaagg 


^ i" n n 

LCXcygLg Lg 


a /^+" ^ a ^ rt a a a^i ♦* ^^a a a 

ayXCaayaXa yaagXCCaaa 


^■ft*/"^ a a^/1 a#v 

XXCCclaXgag 


52620 


cagttcctgt 


cccatattcc 


"t" -♦- ^ n /I ^ f% «3 

L L Lciggciciya 


CaCCCaaXCa XXXCXCCaXg 


xxcxxxxxxx 


52680 


ctcagctcca 


gtgacttcta 


Ccl L lC lCCCC 


aygyaXXXCC aCaXCCXCCa 


xccccagcxc 


52740 


cacaggtagg 


aagctcctct 


c tyycci lc La 


xydciaxxxcLd Cacxgcaxgg 


xcxgxxcccx 


52800 


gctgaccacc 


cagactcagc 


Cug LUCCaCT 


CyCCCXCXCa CXCXCXCXCX 


CXCXCXXX XL 


52860 


tttttttttt 


tttttttttt 


t L Xacggay L 


cxxQcxcxgx cacccaggcx 


ggagxggaax 


52920 


ggtgtgatct 


cggctcactg 


caciccu ucgc 


cxcccaggxx cacgxgaxxc 


xccxgccxca 


52980 


gcctccggag 


tagctgggat 


tacaggtgca 


caccaccatg cctggctaat 


tttttgtatt 


53040 


tttagtagag 


acggggtttc 


accaLgL xgg 


ccaggcxggx cxxgaacxcc 


xgaccxcaag 


53100 


tgatctaccc 


accttggcct 


cccaaagxyc 


xgggaxtaxa ggcatgagcc 


accacgccag 


53160 


gcccactctc 


taaattttga 


ccacccrgcc 


xxgagxggxc x tc rage ^.cc 


cxaaccLCxg 


53220 


tctaacctcg 


agagctttgc 


aCXaycyaXu 


ccxygggacc agcxaxggxx 


ggxaxcLLCX 


53280 


caactttcta 


atttttttaa 


aaXXaXXaXX 


a+'4'a'l'iha't*'t*a ■t*<t*a+>'t>+>'l>a a a 

axxaxxaxxa xxaxxxxaaa 


+>#v#va M'f" 

xggagxcxcg 


53340 


ctctgtcacc 


caggctggag 


xgcagxggca 


ccaxcxcggc xcaxxgcaac 


^4>^4' ^ ^^4b^^ 

cxcxaccLCC 


53400 


cgggttcatg 


caattttcct 


yi.l.Xv.ayi»C.a 


yaaaxxxxcx Cayxyyxcya 


gaLXyiycca 


53460 


ctgcactcca 


gcctgggcaa 


xyyayviXayy 


^^^^a't'^'t'^a aaaaaaaaaa 
v-XCuaXCXCa aaaaaaaaaa 


s 9 9 9 9n9^n 

aaaaaagaCg 


53520 


gaggtcgggc 


attcctaacc 


a a c* ^+Ti 
V. I. LaaL^C Ly 


LC-xxyxyaxx cxyyayxxax 


n9AI9^9A9 9#* 

yayaLayaaC 


53580 


ctggtgtccc 


gtaattaaaa 


ttccgccttc 


aggccttatg ttttgtgagt 


cacaacactg 


53640 


caaacttttt 


acatgctgta 


gacaggatgt 


tcactctcca cttcctcact 


gctctgctct 


53700 


aatcaattca 


accatttatg 


tgacatgcct 


aacccctctg ggcttgtacg 


tatgtaacat 


53760 


gtattacaaa 


gcaagtcatt 


ccatgatcaa 


tgctgtcact ttttctaggt 


gctttcaaaa 


53820 


tttgttcttc 


atcattgatt 


ttcagtagtt 


tgattacgat gtgtctgggc 


atggttttct 


53880 


ttgagtttat 


cctgcttaaa 


gtgttctcag 


cttcttgagt ctcaaagtgt 
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ctctgattct 


ttctcccctt 


cggacctcca 


atgaaatgat 


gttgcccgaa 


gagaccctga 


54000 


ggttctgttc 


attttgttat 


ttatcaatct 


tttttcctct 


ccgaatttca 


ggtttaataa 


54060 


tttttttttt 


ttttttgaga 


cggagtctcg 


ctctgtcgcc 


caggctggag 


tgcagtggcg 


54120 


cgatctcggc 


tcaccgcaag 


ctccgccccc 


tgggttcacg 


ccattctcct 


gcctcagcct 


54180 


ccggagtagc 


tgggattaca 


ggcacccgcc 


accatgcccg 


gctaattttt 


tgtatttttt 


54240 


agtagagacg 


gggtttcacc 


gtattagcca 


ggatggtctc 


aatctcctga 


cctcgtgatc 


54300 


cgcccgcctc 


agcctcctaa 


agagctggga 


ttacaggcgt 


gagccactgc 


gcccggccca 


54360 


ggtttaataa 


tttttataga 


atattttcac 


aatcaccaag 


ccttttctct 


accagctcca 


54420 


ttctgcccat 


ccattgaatt 


ctttttatct 


cagttacttt 


atgtttcagt 


tcgaaagttt 


54480 


ctacttggtt 


agatagatag 


atgttatatc 


atatattata 


tgttatataa 


aaatatattt 


54540 


atggttatac 


atataacata 


tatgttatat 


atagttattt 


atatagccat 


aactatatat 


54600 


agccatatat 


atagttatat 


ataaccatat 


atatagttac 


catatagtaa 


ccacatatat 


54660 


aaaacatata 


tatatagtgt 


ctctctatat 


atagttatat 


atatagtttc 


tatatctgta 


54720 


actatatata 


gttatatatg 


tatgtttctc 


tgtatataaa 


tatatatatt 


tctatatata 


54780 


tagttataca 


cattatatat 


ataactggga 


gatgttggta 


aaggatggcg 


tgaggaaacc 


54840 


tggagcagtc 


atggtaatcc 


tcgctctgct 


ccgaactcct 


caagagcagg 


agaagggtcc 


54900 


tcctcattct 


ccagccatgt 


tgactttgag 


caatttactc 


atcctctcag 


tacctcagtt 


54960 


tcctcacctg 


ccaattgagg 


ataataatat 


ttcataaatt 


gtttgcaaat 


gttatatgca 


55020 


actctacgta 


agaacaccta 


gcacaggggc 
* 


taccagggaa 


tttggtttaa 


caaatattta 


55080 


tcaggcacct 


attctgggct 


gggcaggggg 


gataagatgt 


tgactaagtc 


aaatgcagtc 


55140 


cctcccctca 


ccaagtttac 


agtgtattgg 


gcaagactga 


aatggaacaa 


gcaattacaa 


55200 


ttgacaataa 


aagacaacca 


agttattgag 


cacttactat 


atggcatgcc 


atatgctatg 


55260 


tatttttttt 


atttttaact 


tttcattttg 


aaataaataa 


taaatataaa 


gtaaataata 


55320 


atataaataa 


ataataaata 


acttttcatt 


ttgaaataaa 


taataaataa 


attcaggaga 


55380 


tgttgcgaaa 


atagtgtagc 


attcccctgt 


atccttcacc 


cagtttctcc 


ccaatggcta 


55440 


catcttacat 


aactctaata 


caatatcaaa 


agcaggaaac 


tgacattgtt 


aaaatccatt 


55500 


ttactggttt 


tacacgcgtg 


tgtgcatatg 


tgagcttgtg 


tatgtgcgtg 


tgtgtgcagg 


55560 


catgtgtgtg 


catgcacgcc 


tgtgtgtgca 


tatgtgcatg 


tgtgcatgcg 


tgtgtgcatg 


55620 


tgtgcatgtg 


tgtgtgcatg 


cgtgcgtgcg 


tgcgtgcatc 


tgtgtgcatg 


tatgcacatg 


55680 


tgtgtgtgtc 


tgtgcacgtg 


tgtgcatgca 


tgtgtgtgtg 


cgtgtgtgtt 


ggtagcccta 


55740 


tgcaattttt 


atcacatggg 


catagcccta 


taatcaccac 


caccatcaag 


attcagaact 


55800 
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gttccattcc cccaaagatt cccctcatgc tagccttcgt aatcatgccc actgagccca 55860 

acactattgc atagaatagc tattctactc tccatctcca tctctgtctc tacaattttc 55920 

ttttgaagat gttatataaa tggaaatgta caacatgtca cctttgaaat tggcttcttt 55980 

tccactcagt gtaatgccct ggagatgtgc tctttttaac agtcatgtaa ccttcctaat 56040 

ttccctccaa aatatcatta tgcccctcgc cgcctttttt tttttttttt ttttttgaga 56100 

cagagtctcg ctctgttgcc caggctggag tgcagtggta taatctcagc tcactgcagc 56160 

ctccgtctcc cgggttcaag ggattcccct gcctcagcct cccaagtagc caggat'taca 56220 

agtgcatgcc accacgcctg gctaattttt gtatttttag tcgagacggg gtttcattgt 56280 

gttggccagg ctggtctcga attcctgacc tcaagtgatc tgcccgcctt ggcctcccaa 56340 

agtgctggga ttacaggtgt gagccaccgc gcccgaccca tattgcccat tgtattacag 56400 

cggaagaaac tgaggtatgg acaggtaaca tgtccatggt cacttggctg gtgaggggca 56460 

gagaggagat ttgaaaccaa atctgactca ctagtgtggc cgtaaccatg gtaactatgt 56520 

ctctctacca tgtggtctcc tctttattaa aggaagggca agttctggga gttttgggag 56580 

ttttgggctt gagtggggaa gggtagccaa gtaaagcagg tgagagaagg tctgctttaa 56640 

ggactgctgt ttgattttta ttgttgttgt tcagtgttca atgggattga gttgactctt 56700 

ttttcccttc ttgttcccca aagcatgaga ctgttccggt ccttttccct tttaacttct 56760 

cagctagagt ttgttagggc gggtatgggc acctggcaga gtctgagacc tcagcttcca 56820 

gtaggcacac gttctgaccc aatacaccta ccctggtccc ctaacctgct tctggtcccc 56880 

taacctgctt ctgggcccag gtaatgcatt ttaggaacat cccacttttc tccttacctg 56940 

gctttccatt atccgtccaa actaaagcac ccacctgtct gcttcagact cttgcttcaa 57000 

gcactccgtc tgggtcctca gaaattgact tacagtcagt tcagatctga ctcaggcgtg 57060 

gccttctttt ctccttcctt gc 57082 

<210> 2 

<211> 42348 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc-feature 

<222> (1) . . (130) 

<223> Exon Rl 

<220> 

<221> miscfeature 

<222> (442).. (510) 

<223> Exon R2 

<220> 

<221> rniscfeature 
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<222> 
<223> 



(2126) . . (2250) 
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n IS a, c, g, or t 



<220> 

<221> misc_feature 
<222> (5479).. (5652) 
<223> Exon r3 

<220> 

<2 2 1> mi sc_f eat u re 
<222> (6301).. (6334) 

<223> EXOn R4 

<220> 

<221> misc_feature 
<222> (6593) . . (6657) 

<223> Exon R5 

<220> 

<221> misc_feature 

<222> (7558) . . (7683) 

<223> Exon Rl 

<220> 

<221> misc_feature 

<222> (8216) . . (8284) 

<223> Exon R2 

<220> 

<221> raisc_feature 

<222> (8877) . . (9050) 

<223> Exon R3 

<220> 

<22 1> mis c_f eatu re 

<222> (9380).. (9413) 

<223> Exon R4 

<220> 

<221> misc_feature 

<222> (9675) . . (9739) 

<223> Exon R5 

<220> 

<221> misc_feature 

<222> (10082) . . (10200) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (10201) . . (10291) 

<223> Exon Rl 

<220> 

<221> miscfeature 

<222> (10524) . . (10592) 

<223> Exon r2 

<220> 

<221> misc_feature 

<222> (11200) . . (11373) 

<223> Exon R3 
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<221> mi sc_f eatu re 

<222> (11722)., (11755) 

<223> Exon R4 

<220> 

<2 2 1> mi s c_f eat u re 

<222> (12016) . . (12036) 

<223> Exon R5 

<220> 

<221> raiscfeature 

<222> (12037).. (12150) 

<223> n IS a, c, g, or t 

<220> 

<221> misc_feature 

<222> (12169).. (12295) 

<223> Exon Rl 

<220> 

<221> misc_feature 

<222> (12532) . . (12600) 

<223> Exon R2 

<220> 

<221> misc_feature 

<222> (13219) . . (13392) 

<223> Exon R3 

<220> 

<221> raisc_feature 

<222> (13723).. (13756) 

<223> Exon R4 

<220> 

<221> mi sc_f eatu re 

<222> (14016).. (14077) 

■ic223> Exon R5 

<220> 

<221> misc_feature 

<222> (15001) . . (15126) 

<223> Exon Rl 

<220> 

<221> raisc_feature 

<222> (15367).. (1543 5) 

<223> Exon R2 

<220> 

<221> mi sc_f eature 

<222> (15475).. (15601) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (15648).. (15773) 

<223> Exon Rl 

<220> 

<221> mi sc_f eature 
<222> (16002) . . (16070) 
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<223> Exon R2 
<220> 

<221> misc_feature 

<222> (16653) . . (16826) 

<223> Exon R3 

<220> 

<221> misc_feature 

<222?- (17158).. (17191) 

<223> Exon R4 

<220> 

<22I> misc-feature 

<222> (17453).. (17517) 

<223> Exon R5 

<220> 

<221> misc_feature 

<222> (17866) . . (18531) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (18532) . . (18657) 

<223> Exon Rl 

<220> 

<221> misc_feature 

<222> (18658).. (18887) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (18888).. (18956) 

<223> Exon R2 

<220> 

<221> mi sc_f eature 

<222> (18957).. (18995) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (19633) . . (19806) 

<223> Exon R3 

<220> 

<221> mi sc_f eatu re 

<222> (20093).. (20140) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (20141) . . (20176) 

<223> Exon R4 

<220> 

<221> raisc_feature 

<222> (20177) . . (20386) 

<223> n is a, c, g, or t 

<220> 
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<221> misc_feature 

<222> (20387) . . (20449) 

<223> Exon R5 

<220> 

<221> misc^feature 

<222> (20450) . - (20483) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (21609) . . (21731) 

<223> Exon Rl 

<220> 

<221> misc^feature 

<222> (21940) . . (22008) 

<223> Exon R2 

<220> 

<221> miscfeature 

<222> (22605).. (22778) 

<223> Exon R3 

<220> 

<221> misc-feature 

<222> (2 3109).. (23142) 

<223> Exon r4 

<220> 

<221> raisc_feature 

<222> (24607) . . (24700) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (29046).. (29168) 

<223> Exon Rl 

<220> 

<221> miscL-feature 

<222> (29266) . . (29334) 

<223> Exon r2 

<220> 

<221> misc-feature 

<222> (30368).. (30450) 

<223> n IS a, c, g, or t 

<220> 

<221> ffliscL-feature 

<222> (33917).. (34090) 

<223> Exon R3 

<220> 

<221> mi sc_f eatu re 

<222> (35192).. (35259) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (36702).. (36734) 

<223> Exon R4 
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<220> 

<221> misc_feature 

<222> (38270) . . (38320) 

<223> Exon r5 

<220> 

<221> tnisc_feature 

<222> (39104).. (39224) 

<223> Exon Rl 

<220> 

<221> nn'sc_feature 

<222> (39315) . . (39383) 

<223> Exon R2 

<220> 

<2 21> mi sc_f eatu re 

<222> (39532).. (39705) 

<223> Exon r3 

<220> 

<221> misc_feature 

<222> (41862).. (41992) 

<223> Exon R4 



<400> 2 
agcagccaca 


gtcccattca 


tggtgccatt 


caccctcaac 


ttcaccatca 


ccaacctgca 


60 


gtacgaggag 


gacatgcggc 


accctggttc 


caggaagttc 


aacgccacag 


agagagaact 


120 


gcagggtctg 


gtgagagccc 


cgcccaccgt 


actcctccct 


cgcccactta 


gacaaaccag 


180 


cccacctcac 


actgcctcgc 


ccactgatgc 


cagccacgcc 


cacctcatcc 


aaccccagac 


240 


acctttccct 


gccccaccca 


ctgattttag 


ccaagcccac 


ctcaccccac 


ccagcctact 


300 


gatgccagcc 


acgcccacct 


ttccctgccc 


cgcccactga 


tttcagccac 


gcccacctca 


360 


ccctggtcca 


cccctccaat 


gccccactct 


tcctggcttc 


ccgcagctgt 


tgtttctcac 


420 


ctcccctctc 


cttccttgca 


gctcaaaccc 


ttgttcagga 


atagcagtct 


ggaatacctc 


480 


tattcaggct 


gcagactagc 


ctcactcagg 


tgagacgctc 


cttaagaaaa 


acacagccca 


540 


acaggtgaat 


atgaccctag 


tctctgggct 


ccctgactct 


gttcatactt 


ggaacaacta 


600 


ttgcccatgg 


atactaagca 


tcaccaccag 


cagcagcaga 


taactattcc 


taagacccaa 


660 


ggcactgcat 


tatgtacttt 


atatttaatg 


cctcatcagt 


gcttgcaaca 


gcctcatgaa 


720 


gcaggagcag 


aaggggaaac 


tgaggcccag 


attaagtggc 


ttgtgccagg 


acacacaaag 


780 


caactgcagc 


acttcaggtt 


ctatatccaa 


actcctatcc 


cttaggtggc 


acttcctcct 


840 


ctgcccccat 


tatgaacttg 


cagcatgtgg 


aaaaccccaa 


tctgacttcc 


ctctaaggga 


900 


acttgcccag 


agaatctaag 


aggggaggaa 


aggaaggcgt 


tcagccctta 


caggcaggag 


960 


gtcagctcct 


gagtggctca 


gatgcagcca 


cagagggcct 


ggccggtctg 


agggtgactg 


1020 


^gs^ggcaccg 


agggcactgt 


ccctgagtgc 


tggaaagggc 
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ctggcattct caccttccac acctctctca cagaatctcc 


1140 




cxcccaagag 


agagtgtcag 


tcatctacct 


ccagcttcct ttccttccca 


1200 


99999^^9^9 


gggacagggg 


ggccctagtg 


gctaagagca 


ttggtgaact caggcagacc 


1260 




accaacccag 


ctctgccatt 


tactatctgt 


gactctgagc aagtgcctga 


1320 




gcccxaxxxc 


ctgacatatt 


atatatataa 


aatacatata ttatatatag 


1380 




xaxacaxaxx 


gaggcatatt 


ttataaacat 


gtttatagac acatttttat 


1440 


atgcatatgt 


tatatacgta 


tataacatat 


gttatatata 


atgtatatat tatacatatt 


1500 


gXLataLLy^ 


atacatgtta 


tatatgttat 


agcatatata 


gtacaagtta tatataacac 


1560 


aTacHtratg 


ttacatataa 


tgtatatgtt 


atatatgata 


tattatatat aattatatat 


1620 


caLaxaaaac 


xgxtaxatat 


aattatatat 


aatatatagt 


tgttatatat aattatataa 


1680 
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xxaxaxacaa 


catataacat 


acattatata 


ttgttatata taatataata 


1740 


xaxacaxaxa 


xaacaxaxgt 


ataactttta 


tgttatacat 


aatgtatata acatatatgt 


1800 


gtatgtgtga 


tgtacataac 


atatctgaca 


ttaacatata 


acatatgata taacaatatt 


1860 


axatgtxaxa 


acataatata 


tgttataata 


taacaatatt 


atatgttata acttatactg 


1920 


tcatatgtaa 


catatacata 


atattttata 


aatcagttta 


atatacatta tgttacatat 


1980 


aatgtatgtt 


atatatgata 


tattatatat 


aattatatta 


tacataattg ttatatataa 


2040 
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cgtattatat 
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ataacatctg tataacttat 


2280 


atgtxaxaxa 


taagtatata 


acatatatgt 


gtatgtgatg 


tatataacat atctgacatt 


2340 


aacatataac 


atatgttata 


atatgacata 


ttatatatat 


tacatataac gtatatcatg 


2400 


xaxaaxaxaa 


xgxgtaxata 


taatatatta 


aagtatataa 


gtataaatac atgtaatatt 


2460 


XaaaXaXaXa 


xxaxaxaxag 


tatacatgtg 


gatacataca 


acttctacat atacctagta 


2520 


xaxaxxcxax 


axaxaaacag 


tccatgaatt 


acaatgattc 


aacttatgat ttttcaaact 


2580 


xxgxgaxaax 


gccaxagcaa 


tatgcattca 


gtagaaagca 


taccttcaac acccatgcaa 


2640 


ccaxxcxgxc 


attcacttxc 


agtacaatat 


tcaataaatt 


atatgagata ttcaacagtt 


2700 


tattataaaa 


taggctttgt 


gttaggtgat 


tttgcccaca 


tgtaggctaa tgtaagggtt 


2760 


cagagcatgt 


ttaaggtagg 


ataggctaac 


ctatcatgtt 


ctgtaggtta ggtatagtcg 


2820 


atttttattt 


ttatttttat 


ttttgagaca 


gagtcttgct 


ctgtcaccca gactggaatg 


2880 


cactggtgcg 


atcatagctc 


actgcagcct 


tgaactcctg 


ggctcaagtg atcctcctac 


2940 


ctcagcctcc 


tgagtagctg 


ggactacagg 


tgtgtgccac 
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ttaatttttt tttttttgtg gagaggaggg tcttgccatg ttgcccaggt ggccttgaac 
tcctgggctc aaggaatcct cccaccttgg cctcccaaaa tcctgggatt acaggtgtga 
gccatcacgc ccggctacag ggcatttttg acttatgaca ttttcagttc acaatggatt 
tgtcagggct gggcatgatg gctcacacct gtcatcccag cactttggga ggctgaggca 
ggtggatcac ttgaggccag gagtttgaga ccaggctgtc caaatggcaa aatcttgtct 
ctactaaaaa tacaaaaatt agccaggcgt ggtgtgacaa ctgtagttcc agctactcgg 
gagactgaag cgtgagaatc acttgaactt aggagatgga agttacagtg agtcaagatc 
acaccaccgc actccagcct ggatgacaga gcaagactct tgtctccaaa aaacaaaaaa 
caggctgggt gcatggctca tgcctgtaat cccagcagtt tgggaagctg aggcaggttt 
atcacctgag gtcagtagtt cacgatcagc ttggcaaaca tggagaaaac ccatctctac 
taaaaataca aaaattagct ggatgtggtg gtgggtacct gtagtcccag ctactcggga 
ggctgaggca ggagaatgga ttgaacctgg gaggcagagg ttgcagtgag ccaagatcac 
accattgaac tccagcctgg gcaacagagt gagactccat ctccaaaaac aaaagaaagc 
aaaaacaaaa aaataaaata aaaaacctgt gtttatcagg acataatacc atcatgagtc 
aagaagcatc taaatgtaca tggtagttat ataaaaatag ttatatagtt atatacaata 
gttatatata aaccagttta atatatgtta agtagaggta tatggtagtt atataaaaaa 
tagttatata atagttatag agttatataa ttatataaaa tagttatata taaaccagtt 
taatatatgt taggtagagg tataataata tatattgtat atactatata atatagtaat 
gtataaaatg caaaacgata tcatatattt ctatattaag tttatattta cagatctaca 
ttttat-tat tttatgttat atacaattgt gttatacata atataattag tatagtactg 
acttggggaa ttgagcagta ccaacccata gggatgtttg aggatgaaaa tatgtgatta 
tgaatacaaa atgctgggcc tgctgcatag gaagtattta ataaatggta gttgttacta 
taaagtcgtt cctactatag agctactcac aacctgggac atagggaaag agcccgtttc 
cctctaatca ctcaatagtg ggtggctagg taggtgagtc cacatcctgt ggccgggaac 
aggtgctgag acatgaagac cttctgactg catgttggac cagccacagt ttcagacgga 
ccagccaaaa agggcatttt ccccaagcca tttagctccc ttgagtctca taacaaatct 
cctagaccct gctggtccat aggatctaga gaggatgact tgaaccttct gatcccacca 
tttgaaaacg ccatgccatg ggcaccagta ggagggccac tgctacgtgc accagtacaa 
gggccactgc catggattac agattaaccc taagtatagc tgtcgcacac ctagtacttc 
aggaggctta ttcggggcca tgcagatccc tggcattatt atcctaggat cctacaccaa 
gcaaagcagg agctgcccct cctcataaac ccataagccc tcctcttgag caaagcagct 
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gggaaggcca 


gaagttattc 


aagctcccct 


ctgccccggt tccaaagaca 


gacagctcaa 


4920 


gcctacatgc 


agcaaaccGt 


ataaaagtgt 


cacctcttgg catttctgcc 


atggtaatgc 


4980 


tttctgcttc 


cactaataat 


cctagtaatt 


tgtttatggt gggcatctct 


ctgatgagaa 


5040 


ccacattctt 


tttttttttt 


tttttttttt 


ttgagataga gtctcactct 


gttgcccaga 


5100 


ctggagtgca 


gtggcgcgat 


ctcggctcac 


tgtaaccttt ggctcctagg 


ttcaagcaat 


5160 


tctcctgcct 


cagcctccca 


agtagctggg 


actgcaggca cgtaccacca 


tgcccagcta 


5220 


atttttgtat 


ttttagttga 


gacggggttt 


caccatgtta gccaggatgg 


tctcaatctc 


5280 


ttgacctcat 


gatccacctg 


ccttggcctc 


ccaaagtgtt gggattacag 


gcatgagcca 


5340 


ccatgcctag 


cctgagagcc 


acattcttgt 


taaccacaat tttctcagag 


tctgcattag 


5400 


gggttgacaa 


agagtggaaa 


ggaaggacaa 


aaggatggag aggtggatgg 


actaagcata 


5460 


tgtaggttct 


tacccaggcc 


agagaaggat 


agctcagcca cggcagtgga 


tgccatctgc 


5520 


acacatcgcc 


ctgaccctga 


agacctcgga 


ctggacagag agcgactgta 


ctgggagctg 


5580 


agcaatctga 


caaatggcat 


ccaggagctg 


ggcccctaca ccctggaccg 


gaacagtctc 


5640 


tatgtcaatg 


gtgagcagct 


gtgatgtggt 


tggaggctct tcctccttgc 


tgagcagcct 


5700 


gtaatcactg 


gcttgaggtc 


acactcactg 


tcaggcaatt gaaaatttgg 


tcctgtgctc 


5760 


tacatgggat 


gactaatttc 


cggacttcat 


ggtatctttt tttttttttt 


tttttttttg 


5820 


agatggagtc 


tcgctctgtc 


accaggctga 


ggtgcagtgg catgatctca 


gctcactgca 


5880 


acctccgcct 


cccggattca 


agcaattctc 


ctgcctcagc ctcctgagta 


gctgggacta 


5940 


caggtgcatg 


ccaccacacc 


cagctaattt 


ttgtattttt agtagagaca 


gggtttcacc 


6000 


atgttggtca 


ggatggtctc 


aatctcttga 


ccttctactc caccttgcct 


tggcctccca 


6060 


aagtactggg 


attacaggct 


tgagccacca 


cacctggcca ggacttcatg 


gtttcttcat 


6120 


catcatggaa 


tgaattccat 


cagggcattc 


ttccctgatg tgagggcact 


gataggaaat 


6180 


ctttaatggt 


ccctgctgca 


tgaaactgct 


tccattgcac cagggtagcc 


ctgaccccta 


6240 


tttggtcccc 


cacatctcct 


tgtaacttac 


ccacactcct ccctccttct 


ctgtgcaggt 


6300 


ttcacccatc 


gaagctctat 


gcccaccacc 


agcagtgagt attcaactca 


tgtccacatg 


6360 


cccatgatcc 


tacaccaagc 


aaagcaggag 


ctgcccctcc tcataaaccc 


ataagtcctc 


6420 


ctcttgagca 


aagtagctgg 


gaaggcagaa 


gttattcaag ctcccctctg 


ccccagtttc 


6480 


aaagacagac 


tcagctcaag 


cccacatgca 


gcaaacccta taaaagtctc 


acctcttggc 


6540 


atttctgcca 


tggtaatgct 


ttctgctctc 


actaatgagg acttctcctc 


agctcctggg 


6600 


acctccacag 


tggatgtggg 


aacctcaggg 


actccatcct ccagccccag 


ccccacgagt 


6660 


aagtaccagt 


caatggcatc 


tctattagag 


catgctatct ctgtcatttt 


tactcagatg 


6720 


aagatggaaa 


atcatagcaa 


atctactgat 


agtgagtgga ccaacgaaat 


ttgttggcca 


6780 
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cctagtgtgt accagatcct agagatacag gagggaaaac aaaaccaata caaaatttct 
gctctcagtg agcttgtatt cttgtcatga tgatgatgtt ggtggtggtg ctgttgatga 
cgatgatgat gatgatgatg atgatgatgc tggtgatact gttgatggtg atagtgatgt 
tgatgacaat gatgatgatg atgatgttga agaaaatgat gctggtgatg gtggtggggg 
ttattatggt aataatgata tgttgagtgt gacgatgatg gtggtggtgt tgatgatgat 
gatgattatt atgctagtga cattgatgat ggtaatggtg atatcaacga cagtgacaat 
gatggtgatg aggatgatgt cggtgatggt ggtggggtta tgatggtaat gatatgttga 
atgtgatgat ggtgatgatg atatttgtgg ttcatgatgg ggattgtcat ggtggtgctg 
gtggtacttg tgatgacaat aatgataata atgatgacaa tgatagtgat gatggtgatg 
gtgataataa agataacaga tatcacctta caatattgag cactaaatat gtaccaagag 
ctatgctcag tatctaacta ctattatata atctacttta gaaaatgaat tgtatcatag 
ataagaaagg cgtggaaaat atttattatg tcactcaatt taattgctgc atatggttat 
tacaaagtgc tattctctct actttgaaca taatgtttat ttcacactcc cactatagct 
gctggccctc tcctgatgcc gttcaccctc aacttcacca tcaccaacct gcagtacgag 
gaggacatgc gtcgcactgg ctccaggaag ttcaacacca tggagagtgt cctgcagggt 
ctggttagtg tcctgccctc cacactctgc cctgctcatg atacccagtc cctcttacat 
catccatgcc agggcaatgg aagaatatca aacccaactc acttttgccc caagagatgc 
aagcctcagc caggagcggt ggctcacgcc tgtaatacca gcatttggga ggccaaggcg 
ggtggatcac ctgaggtcag gagtttgtga ccagcctggc caacatagtg aaacctcatc 
cctactaaaa tacaaaaatt agccaagcat ggtggtgcat gcctgtaatc ccagctactt 
gggagggtga ggcaagagaa tcacttgaat caaggaggca gaggttgcag tgagtcaaga 
tcatgccact ttactccagc ctaggcaaaa aagcgaaact ccatctcaca aaaaaaagaa 
aaaaagagag agatgcaagc ctcccccacc aaggccagcc ctgcccacct cacttctgcc 
tggctcttac ataaaactta gccctcctac tcactgccct ctccctcctc cacagctcaa 
gcccttgttc aagaacacca gtgttggccc tctgtactct ggctgcagat tgaccttgct 
caggtgagaa cttagaattt ccagcctggc tgccccactt gtactcactc caaaagactt 
tgcactgctt ccttgctgca cttcctaggg atatcctcac caaaggtgga attcaggagt 
cacaggcttc aggatcagtg tgtttcctga cagtaacacc cctacactcc acctcaacag 
agagaatctg catggcccat catcaggatt gagcctctcc ctttatcatc cctctgaatt 
ccctccattc cctgtgcctc cctttccttt acatgttaaa ttctgtcccc aggatttctt 
tcaggacaat catgccttat ccacgtgatt tcatcctcat ttcgagctct tcactgggct 
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caagtccggc tccccgtccc gtccatgaaa gtgtcagttt catcttgtca ctgtatccgt 8700 

gactccactc acagtcctca gcaagccaat agtccatgca ctaagagtcg atgtggcttc 8760 

tcacctcttt cccaggtttc tcatttctct ggtccttgct gtccttccct cagcaatcgc 8820 

aagacccttc ctagataaac ttttcattgt gatttttccc actgaccctc cccaggcccg 8880 

agaaagatgg ggcagccact ggagtggatg ccatctgcac ccaccgcctt gaccccaaaa 8940 

gccctggact caacagggag cagctgtact gggagctaag caaactgacc aatgacattg 9000 

aagagctggg cccctacacc ctggacagga acagtctcta tgtcaatggt gagtggctgt 9060 

gatgtggttg aaatctcttc ccccttgctg ggcagcctct aatctctaac tagagatcac 9120 

actccctgcc tggcctttga aaattctgtc atgtgctcta catgggatga ctaaggtctg 9180 

gacttcatgg tttccttacc atcatggact gtgttccctc agggcattct ttcctgatgt 9240 

gaggatgctg atagaaaatc ttcaattgtc cctgtaccat gaaactcggt tcattgcacc 9300 

agggtagcat tgacctccat ttggtccccc acctctcctt gtctcttacc cactctcctc 9360 

cctccttctc tatgcaggtt tcacccatca gagctctgtg tccaccacca gcagtgagta 9420 

ttcaactcat atccacatgc ctcggttcct acaccaagag gagcaggagc tggcccctcc 9480 

tcataaaccc attaagtcct cttcataagc aaaggattta ggagggcaga agttatttaa 9540 

gtgtccctct gcccagctca agagaccgac ccagctcaag ctacacatgc aacaaacccc 9600 

ataaatagtc tcccctcttg ccatttctgc caagagagtg ctttatgctt tcactgatga 9660 

gaacttttcc tcagctcctg ggacctccac agtggatctc agaacctcag ggactccatc 9720 

ctccctctcc agccccacaa gtaagtatca gtcaatgaca tctctatgag agcatacctg 9780 

attagtgtaa acatctctgt cattttcact caaataaaga tggaaaatca tagtaaatct 9840 

agtgatactg agtggacaaa tttgtttgtt tgttttttct catccttttc acttttttta 9900 

ttatacttta agttttaggg tacatgtgca caatgtgcag tttagttaca catgtataca 9960 

tgtgccatgc tggtgtgctg cacccatttg ctcgtcattt agcattaagt atatgtccta 10020 

tgcgatccaa gcccacgcgc cgcaccacgt gcaacagttt cacagattgg atggtccgat 10080 

annnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10140 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 10200 

cttcaccatc accaacctgc agtatgagga ggacatgcat cgccctggat ctaggaagtt 10260 

caacaccaca gagagggtcc tgcagggtct ggttagcacc ctgccctctt cactctcccc 10320 

cgccctggat gccgagcccc tcatacaaca ttcatgccag ggcaatggaa gaatatcgca 10380 

ccaaccttgc cctcatcccc agagatgcaa gcctcaccca ctgaggccag ccactctcat 10440 

gggtgtctgc cccacccacc tcacttttgt ccccacacag ggaccttagc cctcctactt 10500 

acctctctct ccctccccca cagcttagtc ccatattcaa gaacaccagt gttggccctc 10560 
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tgtactctgg ctgcagactg acctctctca ggtgagacct tagaagatcc agcctggctg 
ccccagttgt tcccactcca gtagattttg ctctgcttcc ttgctgcacc tcctagggat 
atcctcacca aaaggggaat tcaggagtca ctggcttctg gaccaatgtg tttcctgata 
gtaacactcc cacacctcac ctcaacaggg agaatctgca tggtccatca tcaggattga 
gcctctatcc tgatcatccc tcagaattcc ctgcccctcc ctttcattta ggtgttaaat 
tctgtcccca gaatttctct caagacaatc atgcctcatc caagtgcttt catccctgtt 
tctagctctt cactggtctc aagtctgggc tctcctgtcc ccatgctatg agaatgcagg 
tttcaccttg cacttttata agcatggttg tatctgtgac tctgtgcaca gtcccaagca 
agccagtagt ccatgcactc agagaatcta agtgtagctt ctcacctctt tcccaggttt 
ctcatttcct ctggttcttt actgtctttc catcagcagt ctcaggacac aacctaagta 
atcttttcat agtcattctc cccacctacc ttccccaggt ctgagaagga tggagcagcc 
actggagtgg atgccatctg catccatcat cttgacccca aaagccctgg actcaacaga 
gagcggctgt actgggagct gagccgactg accaatggca tcaaagagct gggcccctac 
accctggaca ggaacagtct ctatgtcaat ggtgagcagc tgtgatgtgg ttggagtctt 
ttccttctag agtctggaaa gaatctaatc tgtggcttga agtcacactc cctgcctggc 
cattgaatat tctgtcatgt ggtgtagatg ggatgacaaa gttctggact tcacagtttc 
ttcattgtcg tgaactgtgt tccctcaggg cactcttccc tgttgtgagg atactgatag 
gaattcttta atggccccag tcccatgaaa ctcattgtcc catgaaactc atttaattgc 
attgggattg ccatgacctt attgtgtccc tcgtatctcc ttaacgctta ccaagtctcc 
tccctccttc tctatgcagg tttcacccat cggacctctg tgcccaccac cagcagtgag 
tattcaactc atgtccacat gcccctgatc ctacattaag tggagcagga gctggcccct 
cctcttaaac ccataagtcc tcctcttgag caaaggagct gggaaggcag aagttattga 
agctcccttc cacctagctc caaagacagg cccagctcat gcccgtatgc agcagacctc 
ataatagtct accttcttgc catttctgcc atgagattat tttctgcttt cactgatgag 
cactttttct cagctcctgg gacctccaca gtggacnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn attttcaatt cccactacag ctgctggccc 
tctcctggtg ctgttcaccc tcaacttcac catcaccaac ctgaagtatg aggaggacat 
gcatcgccct ggctccagga agttcaacac cactgagagg gtcctgcaga ctctggttag 
tgcccttccc tcctcactct gcccagcccc agatatccag tcccttctac atcatccatg 
ccagggtgat gaaagaagat agcaacaact tccccccttc cccccaagag atgcaagccc 
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cacccacaga gaccagtcct gcttattggt gcctgctcca cccacctcac atctgccccg 12480 

acacacacac accttagccc cactactcac ctccctctcc ctcctctaca gcttggtcct 12540 

atgttcaaga acaccagtgt tggccttctg tactctggct gcagactgac cttgctcagg 12600 

tgagacttta gaagagccag cctgggtgcc caaacttgtt cccactctaa aagactttgc 12660 

actgcttcct tgctgcactt cctaggtata tcttcaccac aaggggaatt caggagtcat 12720 

tggcttgaga accagttgtt tcctgatagt aacaccccca tgccccaact caacatgcaa 12780 

aatcttcatg gttcatcatc aggattgaga cactaccctg attacccatc tgaattccct 12840 

cctttccctg acccctccct ttcatttagg tgttaaattc tgtccccagg atttctctca 12900 

agataaccat gcctcatcca catacatgca tccgcctttc aagctcatca ctagtctgaa 12960 

gctctgggtt ctcctgttcc catgccatga gaatgcaggt ttcaccttgc acttttataa 13020 

aaattattat atccatgact ctgcttgcag tcccagacca agatagtggt ctatgtactc 13080 

agataatcta agtgcagatt ctcacctctt tcccagattt ctcatttcct ctggttcctt 13140 

gatatgtttc cctcagcaat ctcaagacaa gtcctaggca atcttttcat tgtcattccc 13200 

cctcctacct tcctcaggtc cgagaaggat ggagcagcca ctggagtgga tgccatctgc 13260 

acccaccgtc ttgaccccaa aagccctgga gtggacaggg agcagctata ctgggagctg 13320 

agccagctga ccaatggcat caaagagctg ggcccctaca ccctggacag gaacagtctc 13380 

tatgtcaatg gtgagcagct gtgatatggt aggggtctct tcctcctggc tgtgcaacca 13440 

tctaatctct ggcttggggg cacactccct gcctggccat tgaaaattct gtcacgtgct 13500 

ctacatggga tgactaagtt ctggacttca tggtttcttt gttatcatga gaggcattcc 13560 

ctctgggcac tcttccctgt tgtgaggatg ctgataggaa atctttaatg acccctgtcc 13620 

catgaaactc atttaattgc accagggtag tcctgaactc tatcgcgtcc cccacatctc 13680 

cttaaccctt acccagtctc ctccctcctt ctctatgcag gtttcaccca ttggatccct 13740 

gtgcccacca gcagcagtga gtattcaact catgtccatg atgcccctga tcctacatca 13800 

agtggagcaa gagctggccc ctcctcttta acccataagt cctcctcttg agcaaatgag 13860 

ctgggaaggc agaagttact caagctcccc tctgccccag ctccaaagac agacccagct 13920 

caagcccaca tgcagcagac ctcataatag tctatcttct tgccatttct gccatgagag 13980 

tgctttctgc tttcactgat gaggactttt ttcagctcct gggacctcca cagtggacct 14040 

tgggtcaggg actccatcct ccctccccag ccccacaagt aagtaccagc caatggtatc 14100 

tgtattagat catgcctgat gaatgcaaac atctgtgcca ttttcagtca aatgaaaatg 14160 

gaaaatcata ataaatctag tgatactgag tgaaccaaaa aaaatgtatt ggccacctac 14220 

agtgtaccag accctaggga tatagcaagg aaaatagaac caataaaaac atctctgccc 14280 

tcagtgagct tgtgttcatg tgatgatatg atggtggtgg tggtggtaat agtaataatg 14340 
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acatattcag tttgatgata atttatgatt atggtgttgc tgttgatgat ggtggtggtg 14400 

atgttactga caatgatggt gacggatctt tgaggatatt gtccgtgatg gtcgtgaaga 14460 

ttatgatgat aatgatgatg tgttaagtgt gatgatgatg atgatctgtg gtgatgctgt 14520 

ttaggatgct gttccgtggt accgatgata ttgatgttgg tcgtggttat gttgtatgac 14580 

aatgacaatg atggtgatga ggataatcgc cagtgatggt gtgggtttat gatgatgatg 14640 

atgtgttgaa tgtggtgatg ataatgttcg tggtggtcgt gatgggcatt actatggcag 14700 

tgatggtcat aataatgatg gtgatggtga caatgatagc aaggatgatg atggcaataa 14760 

agatagtaca taacatcaga caatattgag ctctgaatat gcaccacgag gagtgctcag 14820 

catctaaata ctattatata atatattttt gtaaaaataa attgtattgt tttaggcaag 14880 

ggaagcatgg taaatatttt gtcactcaat ttaaattctg catatgttta aagataagtc 14940 

tattgcaaac tcctattttc tctactttgg acatagtgtt tgtttcccac ctccactaca 15000 

gctgctggcc ctctcctggt gccattcacc ctcaacttca ccatcaccaa cctgcagtac 15060 

gaggaggaca tgcatcaccc aggctccagg aagttcaaca ccacggagcg ggtcctgcag 15120 

ggtctggtta gtgctccacc ctcctcactc cgccccaccc cagagagtca gtacctccta 15180 

catcatccat gccaggtgat ggaacaagat catacccacc tcacccttgc cccaagagat 15240 

gcaagccatg cccattgaaa ccagccccac tcactgatgc ctgttactgc cccacctgac 15300 

ttctgcccta cacacccaca cacgcaactt agccctccta ctcatctcct tctccctcct 15360 

ccacagcttg gtcccatgtt caagaacacc agtgtcggcc ttctgtactc tggctgcaga 15420 

ctgaccttgc tcaggtgaga ccttagaaga tcaagcttgg ctgccccact tgttnnnnnn 15480 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15540 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 15600 

ngtgttagtc tacttttgaa cactgtttat ttcccatctt cactatagcc gccagccctc 15660 

tcctggtgct attcacaatt aacttcacca tcactaacct gcggtatgag gagaacatgc 15720 

atcagcctgg ctctagaaag tttaacacca cggagagagt ccttcagg'gt ctggtaagag 15780 

ccccacatac ctcattctac cgccactcac catgtttagt cctgcccacc tcacctattg 15840 

cagagcatgg aagatctcat ctacctcatc ttgcccccag atatgcatac cccaaccact 15900 

gatgccagcc ccaccaactg ttgccagccc tgcccacctc ccttctacca cacccctatg 15960 

acttcagtcc tcccactcac ctccctctcc ctcctccaca gctcaggcct gtgttcaaga 16020 

acaccagtgt tggccctctg tactctggct gcagactgac cttgctcagg tgagaactga 16080 

gaacagccag tctgactgat ctgagcagtt tgacctgctt cccttctgca ctccctggag 16140 

atgtccgcag ccaggtggaa tccaggaggc agtggctcta agaccaatgt gcttcctgtt 16200 



Page 45 



wo 2004/045553 PCT/US2003/037041 



022438 . 43867 Cpct2) . ST25 . txt 



cccaccacct 


cccacctcaa 


ctgagagatg 


cagagcccat 


cagcaggact 


gagcttctac 


16260 


cttggtcatc 


cctctgaatt 


ccctcctttc 


ccctacctgc 


ctttccacaa 


gtggttcaat 


16320 


tctgttccca 


ggatttctcc 


caagaaaaac 


atgcctcgtc 


cacttgcttt 


catccccaaa 


16380 


cctagctctt 


cacctgtctc 


aagtatgagt 


tctccttacc 


ccatgctaca 


agaatgcagt 


16440 


ttccactttg 


caattttata 


aaaatccttg 


catccatgat 


tctgctcata 


gttgctaaga 


16500 


gtcagtgcac 


tcagagaatg 


gaagtatggc 


ttctcacttc 


tctaccaggc 


ttctcatttc 


16560 


ctctggcccc 


ctcctgtcct 


gccctgtggg 


atctcagaac 


ccctccctag 


gcaatccgtg 


16620 


tattgtcttt 


ccccaatctt 


gccctcccca 


ggcccaagaa 


ggatggggca 


gccaccaaag 


16680 


tggatgccat 


ctgcacttac 


cgccctgatc 


ccaaaagccc 


tggactggac 


agagagcagc 


16740 


tatactggga 


gctgagccag 


ctgacccaca 


gcatcactga 


gctgggcccc 


tacacactgg 


16800 


«icagggacag 


tctctatgtc 


aatggtgagt 


agttgtgatg 


tggttggagt 


ctcttcctcc 


16860 


ttgctgggca 


gcctctactc 


tctgccttga 


ggtcacgctc 


cctgcctggc 


tattgaatgc 


16920 


tcatccatgt 


tgtctgtatg 


tgatggctga 


ggttggaact 


tcatggtttc 


tatttcatct 


16980 


tggactgagt 


tcatcctcag 


gatctgcttt 


ctggatctga 


gggtgctgat 


agagaatctt 


17040 


caatggttcg 


tgttctggga 


aattccttcc 


attgcaccag 


ggtaccctga 


cccctatata 


17100 


gttccccacc 


actcccttaa 


cccttaccca 


ccctcttccc 


tccctctcta 


tgcaggtttc 


17160 


acacagcgga 


gctctgtgcc 


caccactagc 


agtgagtatc 


cactgatttc 


cagtgctcct 


17220 


gatcctacat 


catgcagggc 


aagaactgac 


ccctcctcac 


atgcccctat 


gtcctctatg 


17280 


agcaaaggag 


ctgggacagc 


acaagttact 


ccctttccct 


tctggcccaa 


gtctcttcag 


17340 


agagagaccc 


agctcaagcc 


ccacatgcag 


caaggtccat 


aaatactcct 


acccgctggc 


17400 


atttctgcca 


tgagagggtt 


caacactttc 


actaatgagg 


ccttctcctc 


agttcctggg 


17460 


acccccacag 


tggacctggg 


aacatctggg 


actccagttt 


ctaaacctgg 


tccctcgggt 


17520 


aagtacaaat 


caatcgcatc 


tctgttagag 


catgcctgat 


gactgtcaac 


atctctgcca 


17580 


ttttcactta 


aataaagata 


aaaaatccta 


gtgaatctac 


ggatgaggag 


tcatccagca 


17640 


aacttaattg 


agtgcctagt 


ttctgcaggg 


ctctagggat 


aagaaagggg 


acacaaaaca 


17700 


gttaaaaata 


tctgctgcaa 


gaaagcttat 


tttattgtga 


gggtgatggg 


agttggtggt 


17760 


ggtgaagtta 


ctggagatga 


tgacaataag 


aatggtgatg 


ctagtgatga 


tgatggtgat 


17820 


aaggatgata 


attatgaaga 


tggtggtggt 


gatgatgatg 


atggtnnnnn 


nnnnnnnnnn 


17880 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


17940 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18000 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18060 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 
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nnnnnnnnnn 
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nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18180 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18240 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18300 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18360 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18420 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18480 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


ngctgccagc 


18540 


cctctcctgg 


tgctattcac 


tctcaacttc 


accatcacca 


acctgcggta 


tgaggagaac 


18600 


atgcagcacc 


ctggctccag 


gaagttcaac 


accacggaga 


gggtccttca 


gggcctgnnn 


18660 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18720 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18780 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


18840 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnctc 


aggtccctgt 


18900 


tcaagagcac 


cagtgttggc 


cctctgtact 


ctggctgcag 


actgactttg 


ctcaggnnnn 


18960 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnagaat 


tcagtcgacc 


taccggcttt 


19020 


gatgattgct 


cagttgaact 


tagaaatgca 


ctgtctgccc 


aatggtccag 


tctcatgagt 


19080 


gtgactcttt 


tctgcctctc 


ttgggtatct 


gatcaagatg 


gactcaggaa 


aagtgctcca 


19140 


gataactgtc 


tccaatataa 


cactgcccct 


gccatcacac 


ccaaatgact 


ggaagtttca 


19200 


cagggtcatc 


agcagggatt 


ggacttccac 


cccggccatc 


cctctgaatt 


ttccctcttt 


19260 


tctccc acc 


tcccttgccc 


ttaggtgtta 


aaattctcta 


actaagattt 


ctctcaagac 


19320 


aaatgtgcct 


cattcacttg 


tttaattccc 


aattccagct 


tgtcacctgt 


ctcaagtcta 


19380 


ggctgtcctg 


tccccatgcc 


atgagaatgc 


aagaaccaca 


ctgaaatgtt 


agaaaaattc 


19440 


ttttatccac 


aagtatgctc 


accgtcccaa 


gctggacagt 


agtcagtgca 


ctcagagaat 


19500 


ctaagtgtgg 


cttctcatct 


gtgtaccagg 


cttctcattt 


cctgtgggcc 


cttcttgtcc 


19560 


ttccctccgc 


aatcttggga 


ctcctcccta 


gacaaaactt 


tattattatt 


cccctcacct 


19620 


gccctctcca 


ggcctgaaaa 


QQatQQqaca 


Qccactagag 


tggatgccat 


ctgcacccac 


19680 


caccctgacc 


ccaaaagccc 


taggctggac 


agagagcagc 


tgtattggga 


gctgagccag 


19740 


ctgacccaca 


atatcactga 


gctgggcccc 


tatgccctgg 


acaacgacag 


cctctttgtc 


19800 


aatggtgagc 


aattgtgatg 


tggttggagt 


ttcttcttcc 


ttgctgagca 


ggcctctact 


19860 


ctctgtcttg 


aggtcactct 


ccctgcctgg 


ccactggtct 


tggccatgtt 


gtctgtattt 


19920 


gatgattgat 


atgaacttca 


ccgtttcttc 


ttcatcttgt 


actggagacc 


ttcatcctca 


19980 
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agtgtacttg tatagaatcc tcaaagccca tgttccctga 


20040 


aactccttca 


attacaccat 


ggtagcactg 


accccttttg gtcccccacc 


ttnnnnnnnn 


20100 


nnnnnnnnnn 


nnnnnnnnnn 

• II II II iiiiiiii II III 


nnnnnnnnnn 


nnnnnnnnnn ttcactcatc 


ggagctctgt 


20160 


gtccaccacc 


agcactnnnn 


nnnnnnnnnn 


nnnnnnnnnn nnnnnnnnnn 


nnnnnnnnnn 


20220 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn nnnnnnnnnn 


nnnnnnnnnn 


20280 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn nnnnnnnnnn 


nnnnnnnnnn 


20340 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn nnnnnncctg 


ggacccccac 


20400 


aatatatcta 


ggagcatcta 


agactccagc 


ctcgatattt ggcccttcan 


nnnnnnnnnn 


20460 


nnnnnnnnnn 

111 IttI Ifll It 1 


nnnnnnnnnn 
■ II iiii ifiiiiii II III 


nnngactcca 


gcctcgatat ttggcccttc 


aggtaagtac 


20520 




cacctrtatl: 


agagtatgca 


tgatgagtgt caacatctct 


gtccttttca 


20580 


ctcaaataaa 


at:taaaaal!c 


atagcaaatt 


gtacgtgatg atgagtcacc 


caacaaactt 


20640 


ctttaaatac 


ccactctcta 


ccaggcccta 


gagataaggc agggaacaca 


aaagaggtaa 


20700 


aaatctctgc 


cctcaaaaaa 


cttcttttat 


tttgaggatg atgtgggata 


gtggtgatga 


20760 


taaliattact 


aaaaataatil: 


acaataatga 


tggtgatgct tatgaccatg 


atgtgatgat 


20820 




al!aaaaat!aa 


tgatgatgat 


attgatgatg gtagtggttt 


tgacagtaat 


20880 




kyo L.y a wya k 


gatagtggtg 


gtggtgatta tgggaaggat 


gacagtggtg 


20940 


y *-yy ^•s** ^yy 


'tciO'taa't'tal! 


ggtggtgatt 


gacaatgtgg tggtgatatt 


gacaatgagg 


21000 


ataataaliaa 


••Wjy **y y *"yy ^ 


ggttatgatg 


gttaaggatg atgtgatgat 


ggtgttggtg 


21060 


atcacaatac 


*-**y ^y y ^-yy ^ 


gatgtggacc 


gtcatggttg tggttgtggt 


ggtgatggtg 


21120 


ci'taa'tca'taa 

J 4>g%t W^^A bglA 


'toa'taa'taaa 


gatgatggtg 


gtgattgtca tgatggtaag 


gatgaaacag 


21180 


taa'tao'ta'tt: 


y y fcy *»y 


ttcctggtgg 


tgatggtgca ggtgatgatg 


tggatgatga 


21240 


y uy a i>y y i. 


y y Lyyciy a L.y 


atagggatta 


tgaatatggt tcgggtctct 


gactggtggt 




yy uy w wyav.Gi 


M ky ancm 


tgatggtcac 


agtgttggtg atgatgatgg 


tggtgataac 


21360 




w|Ci ^ay ^y w 


agtattatgg 


aacacagaac atcaccaaag 


gttatgctca 


21420 


gcatctaact 


attattattt 


agcatgctct 


atgaaaaact ttgatcgtta 


tagtcaaggg 


21480 


aaocataaaa 


accttctatt 


ttatcactct 


ctttaaatct ggttgcatat 


gtttagaaat 


21540 


aaatctatta 


caaactctta 


oict. uy c lv. I. 


CL\m I. L. ^ uy aav. a ^ciy ty ^ u i_ci 




21600 


ccactacagc 


tgccagccat 


ctcctgatac 


tattcaccct caacttcacc 


atcactaacc 


21660 


tgcggtatga 


ggagaacatg 


tggcctggct 


ccaggaagtt caacactaca 


gagagggtcc 


21720 


ttcagggcct 


ggtgagagcc 


ctgcccacct 


cactctgccc tgcccacctt 


gtcttgttcc 


21780 


acctacgtca 


cccattccaa 


ggcatggaag 


aagatctcac ccacctcccc 


tcacctgaga 


21840 


gatagccccg 


ccccctgatt 


acagcccctt 


ccaccttaca tcttcctcac 
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tcagccatct tactcacctc cctcttcctc ctccacaggc taaggccctt gttcaagaac 21960 

accagtgttg gccctctgta ctctggctgc aggctgacct tgctcaggtg agaactgaga 22020 

ataaccagtc tggctacccc aagtgttccc aggcccaagg agtttcatca gctttcttcc 22080 

ttccctccct atggaagtcc tca^cacaag tggaattcag gcgttggtgg ctccaggatg 22140 

aacatatctg ctgatcctac cacctccccc atcaatcgag agaatttgca gggcccatca 22200 

gccagatcag gcttctactt tggtcatcct tctgaatttc ttacttctcc ctacctccct 22260 

ctccttcagg tgttaaattc tcttccaagg tttctctcaa gataaacatc ccccatccac 22320 

ttgctttcat ccccaattcc agctcttaat atttctcaag tctgggctct cctgtcccca 22380 

taccatgaga atgcaatttt ataaaattct tgtattcctg actctactca cattcccagg 22440 

ctgcctggaa gttggtgcat tcagagaatc ttagtatggc ttctcacctg tctaccagga 22500 

ttctcatttc ctctgtcccc ttcctgtcct gcccccagga atctcaggat gcctccccat 22560 

aggcaatcta tttaatgtca tcccccttat ctgccctccc taggccagag aaagatgggg 22620 

aagccaccgg agtggatgcc atctgcaccc accgccctga ccccacaggc cctgggctgg 22680 

acagagagca gctgtatttg gagctgagcc agctgaccca cagcatcact gagctgggcc 22740 

cctacacact ggacagggac agtctctatg tcaatggtga gcggctgtga tgtggttgga 22800 

gattcttcct ctttgctgga cagcttctta ctctctgact tgaggtcaca ctccctgact 22860 

ggccattgac gtcttggcta tgttgtctgt atgtgatgac tgatgtctga acttcatagt 22920 

ttcttcatct tggactgagt tcatcctcag taccttcttc cctgatctga gggtactgat 22980 

agagaatctt caaaggcccc tgttccttga aacttcttcc attccactag ggtatctgtg 23040 

acccctattt gattccccac ctctccctta acccttaccc actctcctcc ctccttctct 23100 

gtgcaggttt cacccatcgg agctctgtac ccaccaccag cagtgagtat tcaaccgatg 23160 

ctccagtagc cccaattata caccaagcag ggcaggagct gtcctgtctt cctatgcccc 23220 

tatgtcctct tcataaagga aggggctggg agggcacaag ttattccctt tcccttctgg 23280 

ccagctccag agagagaccc agctcaggcc cgatatgcag caaggcctgt aaatagtttt 23340 

atttgctgac ctttctgcca tgagaggctt ggatgcttcc cctgaagagg gtttctctgt 23400 

agctcttggg actaccacag tggacctggg aaactctggg gatccacccc ttctactggt 23460 

cccttgaata agtaccagcc aatggcacct ctgttagagc atggctgatg agtgtaaaca 23520 

tctcttccat tattcagtca aataaagatg gaaattcttt ataaatctag tgatgatgag 23580 

ccaaccaaca aactttattg agcattgtga caagccctgg ggctctgcca aatcctgggg 23640 

atatggcatg gatcatgaaa caattaataa tctctcctct cagagagcta tttttatgat 23700 

gatactgatg gtggcaatga tgatgatgtt gatggtgatt atgaccatga tgacaatggt 23760 
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taatgatgat gatggtgatg ttggtaatga tggtggtgat 


23820 


tatgacaata 


atgatggtga 


tggrgacagg 


gatggtgatg attatgatgg 


mm mm^ mm mm^m mm mm ^ 

tggtggtgat 


23880 


aacaaagtta 


atggataata 


ratgaactta 


ttggctactg aatatgcacc 


aaagtgctat 


23940 


gctcagtgtt 


taactagtac 


tatttaatat 


gatttctaaa aaaaatcttg 


aattattata 


24000 


ggcagaagaa 


tcatgggaac 


III ijii 111 1 1 ■ 
cttttatttt 


gtcactcact ttaagtccta 


ttgcatattt 


24060 


tttaagtcaa 


ttgcaaacac 


agtttctctg 


ctttgaacat tgtgtttata 


tccagtcacc 


24120 


ccaatagtgc 


ataaacctgc 


tgattggagc 


aactgtgtct tactcccttg 


tgcttcccta 


24180 


gtatctgctt 


caggaccttg 


tacatggtag 


atcgacagat ttagatctac 


aggaaaatat 


24240 


ggattttccc 


agggaaggaa 


ggaatgaagt 


atgctttctt ataatgtatg 


gaaactttcc 


24300 


tcttctgcct 


tggttcaact 


ttagtgtctg 


ccagagttta cactggaaaa 


ctatatggca 


24360 


tctgctccac 


tccctcatcc 


atgacagaca 


tcattaattg attgcagcat 


tcatggcaga 


24420 


catcaccaat 


tgataatagc 


attcattttc 


tctcagttca aaacagcttc 


agaatggtta 


24480 


ccaaaaaaaa 


aaaattcagt 


cgctaccaat 


tcaattggag ctgactcagg 


attatgggac 


24540 


agaattcaag 


agagttaggt 


tccttgatga 


tgtgtagtgg ctatttgttt 


tccggtccag 


24600 


gctaatnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn nnnnnnnnnn 


nnnnnnnnnn 


24660 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn ctttgtgcgg 


caaagttcag 


24720 


gggccccaaa 


aatttctgtg 


ccccaatcat 


ggcggaccta ggtttaggca 


caaattccag 


24780 


ggattaagtc 


cctggagatg 


ttatggcttt 


tggttttcct agaaaggctc 


agctcaggct 


24840 


cagcttggtc 


atgctgatat 


cctttcttcc 


acttggtcga tttggctgtt 


gatacttatg 


24900 


tatgcttcac 


gaagtttttg 


tgctgtgttt 


ttcagctcca tcggttggtt 


tatgttcctc 


24960 


tctaaactgg 


ttattctagt 


tagcaattcc 


tttaaccttt catcaaggtg 


cttagctttg 


25020 


cattgcatta 


gaacatgctc 


ctttagctca 


tcgtactttt ttattgccca 


tcttctgaag 


25080 


cctacttctg 


tcaattcatc 


catctgatcc 


tccatccagt tctgcaccct 


taatggagag 


25140 


atgttgcggt 


catttggagg 


aaaagaggca 


ctctggcctt ttgggttttc 


agcatttttt 


25200 


tgttgattat 


ttcccatctt 


caggagtttt 


agtttcaggc tttgaggctg 


ctgatccttg 


25260 


gatggggttt 


ttatgggggt 


cttttggttg 


ttgttgttga tgatgatgat 


gttattgtca 


25320 


ctttctgctt 


gtttttcttt 


caatagtcag 


gtccctcttc tgtagggctg 


ctgcagtttg 


25380 


ctaggggttc 


acttcaggcc 


ctattcatct 


gattcgctcc catgtctgga 


ggtgtcactc 


25440 


aaggaggctt 


ggagagcagc 


gaacataggt 


gcctgcttct tctgggacct 


ctgacctcga 


25500 


gggacaccaa 


cctgatgcca 


gtaggatcgc 


tcctgtgtag ggtgtctgac 


aactattgtt 


25560 


ggagggtttc 


gcccagttga 


ctggcatgga 


gagcaggacc catttaatga 


agcactttgt 


25620 


cccctggtgg 


agagggggtt 


cttcactggg 


gggaaaccac atgtctgggc 
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cctcagaact accagaggag aggctaagtc tgctggtcca cagagactac agccatccct 25740 
cccactaggg gcccaagccc agggagtcca aattctgtct ctgagcctct ggctggagtc 25800 
tttggagatc ctgcaaggaa gctctgccca ctgaggaagg atgggtcagg gttagccctg 25860 
aagaggcact ctggctgcag actgccacag ccggtgtgtt gggctgtggg gacaagtctt 25920 
gggaccaagc cgtccagcct acccggctct agcaggggaa aagtacagcc tggagctatt 25980 
gaaaggggtg ccgcccttcc cccgcccagg gagcttagcg tgttaggcag ttgtgagtcc 26040 
agtgctggct gtcgcccctt ccccaaggaa caaaaaagac ttagcaggca gccgcagcca 26100 
gtgctggtcg cccctccccc ggggagttcc gtaggcttag gcagattcca gctgtaagaa 26160 
tctgcgtgtt ctggggttgg gacactaggt cccagtggca tgggttcgcg agtgagatct 26220 
tccaatctgt gagttgcaca gttccgtgga aaaagcacag tttccccctc ttgggtagcc 26280 
cgctcactca ccacctccct tggctggaag gagggggttc cccttccccg tgtgtctctc 26340 
aggtgggcca ccacaccaca ctgctcttcc ttctctctgt gggtcactgc cagccttcta 26400 
gtcaattttg atgagggaac ctggacattt tggttgccag gaaggatcac acacttatta 26460 
cagttttttt caatgtgagc ctctgagcgc tgctgcttat agtcgaccat cttggccccc 26520 
agagtcacac atctgttatt ttttgatgtt ttgattgtgg caattcttgc agaagtaagg 26580 
tggtatcacc ttatggtttt gatttccctg gtcattagtg atgttgaaca tttttttcat 26640 
atgttcatta gccatttgta tatattcttt caacaactgt ctatttatgt ccttagccca 26700 
ctttttgatg ggattgtttt tttcttgcca atttgtttga gttcgttgta gattctagat 26760 
attagtcctt tgttggatat atagattgtg aagattttct cccactctgt gggttgtctg 26820 
tttactctac tgactgtgaa ggaaaagtca atttcttata cgaatttgtc tcactcctac 26880 
ttccaaatga gatcctgggg tttttttttt ctgttaatcc ttcacaatac ttctcccact 26940 
tttttgaact catttgttta tattctgttg tctgcttctc ttttatagga atgtgacttc 27000 
ttatgggctt tctctattat accacatatg ggtttttgtt ttgttttgtt ttgttttgtt 27060 
ttgtttttgt cctcggatcc attctccaac ctcctccagc cttcccgtgc tctgtgggat 27120 
agacgtctga ctcatgaaaa ctacatttcc caggctccca tgctaactag cttcctgtta 27180 
ggttcagcca ataggaggca ttggtgggac aatggtgggc ggggctatgg aagggccaga 27240 
gtatttctgt accccgcccc cctgctcccc ttccaatgtt cctggagcgg tgtaggacca 27300 
atactgtata tatggaagga aggcaaggtg gatagattgg aaggaagaag tgacagatgg 27360 
aaagaagaag tgataaatgg caagcgaggc aagggagcag aggatggatg agtggattgc 27420 
aagaaagaaa aaaatggatg aaatataaaa ggagcaggac agatggataa gtagatggaa 27480 
gtaagaaaag actggtgtaa gaaaggaacg attgatgatg gatgatgaat ggatcagtgg 27540 
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tgattgggtg 


aagggatgaa 


tggatggatg 


gacagatgga 


tgaacagatg 


ggtgggtgga 


27600 


tagatggatg 


gatggataaa 


atgggtaggt 


ggatggatgg 


atggatggac 


agatgggtgg 


27660 


gtaggtggat 


ggatggatag 


atggatggat 


aagtgaatgg 


atggatgg^t 


ggatggatgg 


27720 


ataaatggat 


ggatgggtga 


aaggaaggaa 


agaagtgaga 


gaaggaagag 


gaaggataga 


27780 


cagatgttag 


aaggtacaaa 


tgaaaggaag 


gaagccagca 


agaaagaaag 


gatgcattaa 


27840 


tagaatgaaa 


gatggaaggg 


aagaagaaag 


gatggaaaga 


gagaaggaag 


aatgaacaga 


27900 


aggaagttca 


agagtggtga 


aaagaagaaa 


ggcagggaga 


gaaggagaag 


taaacttttc 


27960 


ttctagagat 


ttgtcttaaa 


ccttagcttg 


gctggacact 


gtggttcacg 


cctgtaatcc 


28020 


cagcactttg 


ggaggccgag 


gcgggtggat 


catgaggtca 


ggagatcaag 


accatcctgg 


28080 


ctaacacggt 


gaaaccctgt 


ctctactaaa 


aatacaaaaa 


aaatttagtc 


aggtgtggtg 


28140 


gtgcatgcct 


gtggtcccag 


ctactcagga 


ggctgaggca 


ggagaatggc 


ataaaacctg 


28200 


ggaggcagag 


cttgcagtga 


gccaagatca 


caccactgca 


ctctagcctg 


ggcgacaaag 


28260 


tgagactctg 


tctcaaacaa 


aaacaaaaca 


aaaaaacaaa 


aacaaaaaac 


aaaaccaaac 


28320 


caaaacaaaa 


aaaaaaacct 


taactcatac 


tttcataaag 


ttccacacac 


agggagtgat 


28380 


tagaaagcat 


ttgctgatat 


atttttatata 


ataaacatgt 


acaccatatt 


gacctgtgtg 


28440 


cccagcagtg 


cttacatgat 


ttacaatgat 


taacttgttt 


aagcttcata 


acaacggttg 


28500 


aggcaggaaa 


catcattgtg 


aaccattgtc 


atctcatttt 


acagatgagt 


aaactgaagt 


28560 


gctgagaggt 


tggttatggc 


tgcaaagatt 


gttggccatg 


ttaaccaatg 


catagaagat 


28620 


tagcatacct 


ggttgtgagt 


gcaggagaga 


gagagaaatg 


ggagaaaggc 


agagaaggat 


28680 


cgatggggag 


agaggaagag 


agagagagag 


aataaatttt 


ttaaaaatgt 


ctagagtcat 


28740 


gacttccgca 


tcagtgtggt 


aatatgcagc 


ctttaccctg 


ggaaagatca 


gaaccattgg 


28800 


tactttttac 


agaatcttcc 


cttcctgcat 


ttgggtagaa 


ggaccccatc 


tggacatccc 


28860 


aaatcattaa 


gcacaccctt 


actggctgct 


ggagttgtct 


ccattaaaag 


tcaccgttgg 


28920 


gtttattaag 


aggcggacac 


agggtcctta 


gaacacactg 


cccccacctg 


tcccacacca 


28980 


ccccccaccc 


acccatcatc 


ctccccaaga 


gcttcatctc 


tctctctctt 


ccccctgccc 


29040 


tagccggggt 


ggtcagcgag 


gagccattca 


cactgaactt 


caccatcaac 


aacctgcgct 


29100 


acatggcgga 


catgggccaa 


cccggctccc 


tcaagttcaa 


catcacagac 


aacgtcatgc 


29160 


agcacctggt 


gagaggcctg 


cctcccgctg 


cagccctgcc 


atgcccatcc 


tagggctgtt 


29220 


gcctgcctgc 


ctctgaccaa 


cccaagctcc 


cttctccctc 


tgcagctcag 


tcctttgttc 


29280 


cagaggagca 


gcctgggtgc 


acggtacaca 


ggctgcaggg 


tcatcgcact 


aaggtgagaa 


29340 


actcccccac 


ccacagcgca 


ccaccaagaa 


cttagagttc 


tgactgggag 


gtccctcttg 


29400 


ggttggggtg 


ggctacatat 


ttttttaaat 


ctttttatct 
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atgaagtttc gctctcgttg cccaggctag agtgcaatgg cacgatcttg gctcactgca 29520 

acctctgcct cccgggttca agtgattatc ctgcctcagc ctccccagta gctgggatta 29580 

caggcaggca ccaccatgcc tggctaattg ttttgtattt ttagtagaga tggggtgtct 29640 

ccatgttgat caggctggtc ttgaactcct gacttcaggt gatccaccct cctcagcctc 29700 

ccaaagtgct gggattacag gcgtgagcca ccatatctgg ccccattctt tttttttaaa 29760 

tgaatttaag gagtgcaaat gcagtttttg ttacatgcat atattccata gtgaagtctg 29820 

cagacagtag acttccagac agtagcttct ggtgtatcac ccgaatagtg tacattgtac 29880 

ttattaagtg aggttcccca cccttctccc actctcccac ctttctgagt atccagtgtc 29940 

tattattcca cactccaggt ccatgctctc acgtataagt gagaacgtat ggtattccac 30000 

catgagctaa tggacatgga gtccattggc tcccacttat aagtgagagc atgcggtatt 30060 

tgactatttc tgagtttcac ttaagataat ggactcccat tccatccatg ttgctgcaaa 30120 

atacatgatt tcactctttt tatggctgaa tagtatttcg tggtatatat atataccaca 30180 

ttttctttat ccagtcttct actgatggac acttaggttg ggtccatacc tttgctgttg 30240 

aaatagtgct gcaataaaca tacacgtgca ggtgtctttc ttatataaat gatttctttt 30300 

tttctttcct tttttttgat ataacgaatt tcttttattt gggttaaatc ccccaatagt 30360 

gggattgnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 30420 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn tgacctgtcc gtattgatat ataaaatgct 30480 

gcatttaaag tgtacaactt gatattttgg tatacattgt taaatcatgg ccacatttca 30540 

gctaattaat atatctatta tctctacata gttatcatgt ttgtaccctt tgaccagcat 30600 

caccccattt gctcctcctc ccagcccctg gcaaccacca tcctactctc tgcttctatg 30660 

agtctgacaa ttttagattc cacctataag ttagattatg cggtatttgt ctttctgtgc 30720 

ctggcttatt tcacttagcc taatgtcctc cagctccatc tatgttatcc caagtggcag 30780 

gattttcatc tttcttatat atttcattgt atatgtgtat gccacatttt ctttacccat 30840 

tcatccattg aaggtcattt agcttgtttc catatcttgg ctattttgaa tagtgctgca 30900 

atgaacatag gagtgcagat atctctttaa gatactggtt tcatttcttt ctttcttctc 30960 

tttttttttt ttctgagaca gagtctgact ctgtcgctca agctggagta cagtggtgca 31020 

atcttggctc actgcaaact ctgcctcctg agttcaagcg attctcgtgc ctcaacctcc 31080 

cagggagttt tgctcttgct gcccaggctg aagtgcagtg gtgcaatctt cactcaccac 31140 

aacctgtgcc tcccgggttc aagcgattct cgtgcctcag cctcccaggt agcaaggatt 31200 

acaggcgccc aacaccacac caggctaaat ttttttgcat ttttagtaga gacggggttt 31260 

tgccatgttg gccaggctgg tctcaaattc ctggcctcaa gtgatccacc tgcctcagcc 31320 
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acaggcatga accaccacac atggcctcat ttcttttaga 




LCl Ufil LGl L«^l^\J 


t" *t* n a n r i" a t* "t* 

i. u9^^^ LCIL L 


ctcagagggt 


ccttttctgc 


atctatttaa 


gatcacattt 




L L L L ua L.CI l> I. 


n^nnraaaaa 


tacatgtaac 


ataaaatctg 


ccattttaac 


catttttaaa 


31 con 


Lclv»aa L UU 


C19 Lgav-clX L9 


attatattca 


caatgtcata 


cagccatcac 


cactatttat 


31 ceo 


L LCXaaXclCL 




gtagatcccc 


aacagtggga 


ttgctgggtc 


aaatggtagt 






xxxxxuxgxx 


ttttgagaaa 


tctccatact 


gtttttcatt 


tgaggttgta 


D-i con 
oXOoU 




XXCCCaCCaa 


cagtgtataa 


gagtttccta 


ggccgggcat 


ggtggcttat 




9 C C. 1. 9 LclaX. V. 


CCdgCdCXXX 


gcgaggccca 


ggtgggtgga 


tcatgaggtc 


aggagatcga 


3XoUU 


yav.CaCCCX.9 


9^XaaCaX99 


tgaaaccccg 


tctctactaa 


aaatgcaaaa 


aattagccgg 


31 RAfl 
3JLOOU 


gcgtggtggc 


gggtgcctgt 


agtcccagct 


actggagagg 


ctgaggcagg 


agaatggcat 


3 1 mn 
31920 


9 aaC C C L gad 


ggcggggcxx 


gcagtgagct 


gagatcgcac 


cactgcacac 


ttcaacctag 




mm M #H M ^« 

gcgacagagc 


gagacxccax 


ctcaaaaaaa 


aaaaaaaaaa 


aaaaggtttc 


ctttcagtgc 




atccttgcca 


acxxgagxxx 


tctgggttgg 


tttgcactct 


catggtattt 


actagatact 




xcxccarLiia 


xaxxxxxacx 


caacccatgc 


ccataacacc 


actcctctac 


cattcccacc 




aaccaLgLar 


aagagxxccx 


tttcttgcat 


ccttgccaac 


ttgacttctt 


tgggtcagtt 




xgcacxcxcL 


xggxaxxxac 


tatttacttc 


tccatttata 


tttttagtca 


actgatgccc 




axggcaccgc 


xccxcxgagg 


caggtgctgg 


gtactagagt 


gataagacag 


atgctgtccc 




tgccctcacc 


cagtggagaa 


gaacagatgc 


taaacaggaa 


cataaatatc 


taagtaaaat 


30AAA 


ggcttcaaat 


ggagxaaagx 


gatatgaaac 


ataaataaat 


agcaagtgat 


gggtagagca 




acxxxaccca 


ggaxgaaxcx 


tgggctgtgt 


cccaaatggc 


catgaaaa'^'t 


gttccaggca 




gggagaacag 


catgagaaaa 


ggtcttgagg 


tgcaaatgag 


cttggcatgt 


tctatgaaca 




gcaaagaggc 


cagxgxggcx 


ggagcagaga 


gagagcaaga 


agaaaagaga 


gaaaggatga 


3 7 AAA 


gacxcaagac 


axcagcaagx 


ttgaagggcc 


ttggaggact 


tggatttttt 


tttttaagac 


3^7 AA 


agcxxxgxxc 


xxgxxgccca 


ggcatgatct 


cggctcacca 


caacctccgc 


ctcctgggtt 


397l^n 


caaacgaxxc 


cxcxgccxca 


gcctcccgag 


tagctggggg 


taacaggcat 


gtgcccacca 




cacctggcta 


axxxxgxaxt 


tttagtagaa 


atggggcttc 


tccatggttg 


gtcaggctgg 




xcxcgaacxc 


ccgaccxcag 


gxgaxccgac 


cgccxcggcc 


xcccaaagxg 


cxgggaxxax 




aggtgtgagc 


cactgcacct 


ggcttggatt 


ttttttgttc 


tatattgtgg 


taacatacac 


33000 


atcacattaa 


attgatcatt 


ttagctatat 


ttcccgttca 


gtggcatcaa 


gcacattcac 


33060 


attattgtgc 


aaccatcacc 


actatcatcc 


atctccagaa 


ctttctcatc 


ttcccaaact 


33120 


gaaactccat 


ccccatgaaa 


cactcattcc 


tcatccccct 


cctcaagcct 


ctggcaccca 


33180 


ccattctact 


ttctgtctct 


gtgaatctga 


tgattctgag 
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atcatatggt atatgtcctg gtttatactg tatggctggc ttatttcacc aagcataatg 33300 

tcctcaaagt tcatccatgt tgtagcatgt gtcagaattc ccttcctttt ccacttgtat 33360 

gtaaatgctg tattgtgttt ctccattcat taggactttg atttttgcag ggagttgtca 33420 

aggggtgctg ggttctgggg cttcaatata ataagagtaa gctaaactgg ttcatttcct 33480 

ccttcgtgga gaccatgttc tggtaggaac aggaacaaat aatttatgat tacatagagg 33540 

gtgaccaggg cagtgacagg ggaagagtgg aggattgtgg gacccagagg aggctcctga 33600 

ccttgcctag gaagatagga ggaggaagag gaggaggaag aggaggagga agaggaggag 33660 

gaggaggagg agggagtcct ctaagctgag acctggagga tgaccaggaa gttatccagg 33720 

taaggagaaa tggggagaag cttccagaca aaagtaacag caattgcaaa gatcctgaga 33780 

tgatagataa ggtcaggtgg agaaagtgca aactgtcaat gagaccaaaa tatggactgt 33840 

gagttgtgca gtgaccacaa gtggagaggt gctaggtggc cttcatcccc caaagctgca 33900 

cctctccctc ctcaggtctg tgaagaacgg tgctgagaca cgggtggacc tcctctgcac 33960 

ctacctgcag cccctcagcg gcccaggtct gcctatcaag caggtgttcc atgagctgag 34020 

ccagcagacc catggcatca cccggctggg cccctactct ctggacaaag acagcctcta 34080 

ccttaacggt gagcagctat cagccccatc tccctgcccc accccccagc ccccactgca 34140 

gtccaggagg gtgtctgttt gccggttctc tagggaaaga cttggggttc aagtcttggc 34200 

attaccactg gccctcccat aaccacaatg caaggttgga ctttgattaa tcccatttta 34260 

cagatgaaga aactgaggct tagacaggct aagcaattta ccttgacagt ggtggaacca 34320 

ggatatgaac tccacttgtc agcattcggt gctatgatcc actccacatg tttaactcac 34380 

agaaga&ect tcctggtggg ggcacttggg ggacaaaaaa cacatttccg gctgtgagca 34440 

gtggctcaca cctgtaatcc cagcactttg ggaggccaag gcgggcggat cacaaggtca 34500 

agagattgag accaccctgg acaacatagt gaaaccctgt ctctactaaa aatacaaaaa 34560 

ttagctgggt gtggtggcgc acgcctgtag tctcacctac tcgggaggct gaggcaggag 34620 

aatcgcttga actcgggagg cagaggttgc agtgagccaa gattgcgcca ttgcactcca 34680 

gtctgggtga caagagtgaa actctgtctc aaaaaaaaaa aaaacaattt cccctccctg 34740 

ctttcttctc accattgacg agggatgggc ttctctccta cctgaggccc cctataccag 34800 

gaagatctat gggatctaat cttcagcgca cactgggcct cagcattggt ctagaactca 34860 

ggataagata gcatttaaga aggcatcccc taaatggggt tctgagaggc aaagcatgac 34920 

cgtggagaat tgacaaaata gctcgccttt catcccctcc accgccaacc caagaacagt 34980 

gcttatcatc atgaccccat gaggtgggca ccccatatca cttatatgag gtacctttag 35040 

gtaggtaccg ggatgtggag agacatcctg ggctttcatt actcttattt tagcaaagag 35100 
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ggaatctgag gcacagagaa gggaagggac ttgcccatgc ccacagcgag tttttggcta 35160 

gtatgggtct tgatgttctt tctgggtccg tnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 35220 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnt tcctgcgtgg gagatgtgtg 35280 

gatttgattt gtatctggaa agatgatttt ttattggtga caaagcagtt aaagttaatc 35340 

ttcacagttg tgcggagagt gaccacgcga gttagtctta tccttatttt tttgatcatc 35400 

ccgctacaca agacaaagcg aaccgcacag gcaacatcag caggccccat tggtgtgttc 35460 

ccctctatgg gtccatgtgt tctcatcatt agctcccacg tataaagtga gaagatggca 35520 

gtattggttt tctgttcctg cattagtttg gtaaggataa tgacctccag ctccaaccat 35580 

gttcctgcaa cggacatgat ctcattcttt tttatagctg catagtattc catggtgtat 35640 

atgttcctca tgttctttat ccagtctatc cttgatgggc atttaagtag attccatgtc 35700 

tttgctattg tgaatagtgc ttcagtgaac aggtgtcttt atgatagaaa aatttatatg 35760 

cctttgggca tatatgcagt gatgagattg ctgggtcaga cggtagttct gtttttagct 35820 

ctttgaggaa tcatcctgct gctttctaca gtggatgaac taatttacac tcccaccaac 35880 

agtgtataaa cactcctttt tatctgcaac ctcagcagca tggttttatt tctctttatg 35940 

gctgaatagt gttccattgt gcatatatac cacactttct ttatggattc atctgctgat 36000 

ggacatatag gttgattcca catctctgct attgtgaata gtgctgtgat aaacacacag 36060 

gtgcgggttg ggtcttgatg atctcagtta acatccagtc ccttcaactt ggctattgca 36120 

gggagctgtt cccccttgta aactgcacag cttatgtgct tcattttgtt ccttcattta 36180 

gatttaccaa gcagctacta ttaaccaggc cacaatgtgc ctcgccccca ggaacagaga 36240 

taggttacat gtgcatcctg tcctaatgta atctccaggg gggcggagac tgttttgttc 36300 

taccctatat tccccaaatg taaagggagc cttgcacata ctaagccctt aataaacatt 36360 

cattgggtgg aggaatagat tggaggaggc ctggaagggg aggcgggggt tatggatgga 36420 

taggaggata gacttgtgaa cacaaaggta gtgagagcct ctcattggag gcatgctgga 36480 

gacgtgagta gggaagggtc agtgctaatt gaaatatcag gaaattcttt ctagtggtga 36540 

acacatttaa gtcaaatatt agatgataca taaatgtatc cataatctct agatacacaa 36600 

agggaaaggc atccaggcag gggccccata tggacaaagg catggagtat ctgggacggt 36660 

tccaccacct cctcttacgt gtgacttctt tgtttcaagg ttacaatgaa cctggtctag 36720 

atgagcctcc tacaagtacg tgtctttgaa tctagtgccc atttcaatct ccatgggtct 36780 

tggttcaagc ttttctcctc attcatgaag gaaggttgcc ccaaattcgg gctggtcccc 36840 

taggtggtga ggggcattgt ctcagtggga ggaagaatgc tgagtccttg gccctgtttt 36900 

tagacctgca gccatagtct tggctttgtg aattttccat gtccctctgg gttggaggaa 36960 

gaagtttgaa caagcattcc ctacacggga tagaggttga ggtcagatga tgacctctgt 37020 
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tagtctgtac cctccttgat aagaaaatct cctccaagtg ccccagcaga ggcttcatgg 
tcaagctgca gactctgctg gctactggtt ttggctaaat ttgcccattg cctcatccag 
tgatccactc gtctatcttt ccagccatcc atttttctat ccttccagtc atctctcaga 
caccacctgt ccttccatcc atccatccgt ccatccattt acccatccat ccatccaccc 
cattttcctg accatttacc tcctcgtcct tccttccatc tgtcctttta tccatctatt 
catccatcac ccatcctcct gcccattcac ctgcttgtcc ctcctttctt ctgtccttct 
atacatccat ccatccatcc atccatccat ccacccatcc actcatccac cacccaccca 
tccttctgcc cactcactcg ctagcccctc cttccttctg tccttccatc catccatcca 
cccatcttcc tgcccattca cctgcttgtc cttccttcta tctgtctttt atccatctct 
ccatccattc tcaccatcca tccatccatc cttctcccta ttcactggtt tgtctttcct 
tctgtccttc caaccatcca cccatctctc cattcattct cctcttcatt caccatgttt 
ccttatttct gtctcttcca tccatccatc tatccagaca gacatctcct ccccccattc 
tcctccccat tcactcaatt gtccttcctt ccatctgtcc ttttatccat ccatccaccc 
atccatccat ccatctatcc ttctccccat tcacctgttt gtccttcttt ctgtccttcc 
aaccatccat ccatccatca tccatccatc tatccttttc cccattcacc tgttttgtcc 
ttccttctgt ccttccaaca tccctccatc tctccatcca tcctcctgcc tattcatctg 
cttgtctttc cttccttctg tccttccatc cattcatcca tctgcccatc cacccactca 
tcctcttgcc cattcacctg cttgtccttc cttccacctg tccttttatc catccatcca 
tccatccatc ttgctcactc ctccactcac acaatcactc cttccctcag tctcatttat 
ggcccacctg tgaatggttg tcctggcttg gaccactgat gaagcccagg ggagcttctc 
ccactagtgg tgggcttttg tcctctctga tggactgttc cttccacagc tcccaagcca 
gccaccacat tcctgcctcc tctgtcagaa gccaca^cag gtatttgggg ccatttttcc 
tcctcgaaga ttagaatagc atttcaatca gacacctgcc ctcgtggagt cccagatttt 
atgaaataaa tagaccatca taatgtcaga tgttttgggg tgagatacct ggcatagttg 
ggaaggagga gggctttctg gagaaagttt cacctgaact gagtctttaa ggatgactaa 
gagtgattca ggcaaatagg gcatgaatag tataactgaa agaggggaat ctgtgagcaa 
agcctcagtg gccagaaaca gcatagagta tagggagaag tgagagaaat ttggtttgca 
tgaaacataa agcttaaccc agagtggatg gataagtgag actgaaaggt cagcaggagc 
cagattggga agggccttga atgccaagtc aagaaatttg aacttaacac tgaaggccat 
agggagctgt ggatggtact agagcagggg cagccatagt gagattgtca tttcagaaag 
attcttcttg tgttcagtat agagaatgtc ctttagacag ggcatccagt gagtctgcca 
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ggtgctaatc agggtgagag aaaataagac ctgaactggg atagggggag gagagagagg 38940 

atatatgtga tgaatattca gtaaagagaa ttggtgttac ttggagggga gaagacacat 39000 

agcttctgac ttgcgatggc cacactcagt ttaataatga gcgcagtctg atctagtctc 39060 

agaccagccc tcagttgcag acgtctctcc tcccctcctg cagcatgggg taccacctga 39120 

agaccctcac actcaacttc accatctcca atctccagta ttcaccagat atgggcaagg 39180 

gctcagctac attcaactcc accgaggggg tccttcagca cctggtgaga ccctggtccc 39240 

agcagctcct ggtgggataa atcctacccc caacctctgt tcctcggctt accctcttcc 39300 

tccttcctct caagctcaga cccttgttcc agaagagcag catgggcccc ttctacttgg 39360 

gttgccaact gatctccctc aggtgagacc acttcctggc catttgccag taacaaccac 39420 

cccttttgtg accacccctt cctcagcttt cccctgctcc tccctccact gctctttacc 39480 

tgcagaggtc tcgggacctc tctagagtcc tcaaatgcct ctctccccag gcctgagaag 39540 

gatggggcag ccactggtgt ggacaccacc tgcacctacc accctgaccc tgtgggcccc 39600 

gggctggaca tacagcagct ttactgggag ctgagtcagc tgacccatgg tgtcacccaa 39660 

ctgggcttct atgtcctgga cagggatagc ctcttcatca atggtgagtg tcaggctgaa 39720 

cttggattta cagtgacttt tggggagttg gtttctttgt ttttgagatg gagtctcact 39780 

ctatcaccca ggctggagtg caatggtgca atcttggctc tgcaacagtg attctcctgc 39840 

ctcagcctcc caagtagctg ggatttacag gtgcatgcca ccacgctcag ctaatttttg 39900 

tatttttagt agagatgggg tttcaccatg ttgcccaggc tggtctcgaa ctcctgacct 39960 

caggtgatcc acctgccttg gcctcccaaa gtgccaggat tacaggcatg agccaccatg 40020 

cccggcccac catgactatt atttgtccct gttgtatgcc ctttcctctc taaaaaaaat 40080 

agcccaaggc ctggctgggg gacacccttc cccaaaccac caaggggagg gtctttccca 40140 

ttattttgag taaatagcat gaaattcttt gaccaaatta atgtcataaa ttgtttgtct 40200 

ctttctcctt cacttttgtt tccaacttgg ttgcggtata actatcaaat acaattgtat 40260 

gtatttaaga tgtataatgc agtgatttaa tatatgtgta gcttatgaaa tgattaccat 40320 

gatcaaatta gttaacactg ctttcatgtc acatagttac cgtgtgtctg tgtgcgtctg 40380 

tgtgagttag agagaaagag aacatttaag gtctaccctc atagaaaatt tcaggtttac 40440 

aatacagtat tattaactat aatcatcaag ctttatactc gatccccaga acttattcat 40500 

cttgtaacta aaagtttgta ttttgtgacc aacatctccc cattttctct atcaccaccc 40560 

ccatgccccc agcccctgat aaccatcatg ctactctctg cttctgtaag tttgacttct 40620 

gatcccacat ataagtgaga tcatgcagta tttgtttctc tctatctggt atatttcact 40680 

tagcataatg aacccccccc aggtacatcc ataatgaatt tcaattcaaa acccaagtgg 40740 

ctgagtcgtg gcatcctttg ggacaggata gcaggtccct tctatataag gatcctctgt 40800 
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gtcagtggtt attaccaggg gacaattctg cacttctgcc ccaccccacc ccccaactgg 
gagactctag gcaatatccg aaatcatttt tgggtatcac aactcaggga gggaaggagg 
gtgcaactgg cacctagtgg gtcggtagcc cattttccag tgcacaggag acaaccaccc 
cagggaatga tccagcccca aatgccaata atttcaaggg tgagaaatcc tgttgtacat 
ggtctcaaag ttcttaggtg ggcacaaggc tgacatttat cacactttac tgtaattact 
tgttaaattt atctgattcc cccttaccct gtgaactcaa caaaattacg gtctattatg 
agtgccactg taccctcggt tcgcagtaca tcagcacatc atagtatgga aagaatcatt 
gaatgagtga gcaaattaaa gatttgtgtc tctgctgtaa ctcacattca ttaattcatt 
cattcagcaa acatatatgg gtggctgttc tgccccaagc cttgtactgg gtctggagat 
agaagacaca tttttctgtc tctgaaaaac tcatactcaa gttaacaaca aattacgggc 
acaacaaaga ccccactgct gttattaaca gggtactatg ggagctgaga ggaggagtaa 
attaaggagg gcttcctgga ggagggtgtt atatacccgg ccctgtgccg ggacacataa 
tgataagaca gacttgggcc tctgctgtcc tggagctccc tctcactggg ctcttgaagc 
gtgagcagga gttttgcagg aaatgaaaag gatgcattcc tagaagtggg aactgcatag 
cacatgcagg aaagctcagc tcagaagaat ctgtgtaata ttccattttt ccctctcttt 
ggggcaactt tctgtctaag agctcctgca atgcccagcg tgtggacctg aaattgattc 
tgacagtagg caggggactg ctgggcaact ttggctctgc attttgtgat caacatttcc 
ccaccatatg ttgccttttc ttcttctctg tggctccagg ctatgcaccc cagaatttat 
caatccgggg cgagtaccag ataaatttcc acattgtcaa ctggaacctc agtaatccag 
accccacatc ctcagagtac atcaccctgc tgagggacat ccaggacaag gtggggcatc 
tctcacccct cccgtcttct ctgtcctgtg tgcttctctc cctcttctac ctgatttctc 
tgttaagtga tcactttaaa tgcttcactt cactatgtat tctgggttct ctctcagttt 
ccaaaagtac tctcttgact accattccca tttcacagat gggcaaactg aggctcagaa 
aggggcgtgg tgtgcctagg gtcatacagt gctttaggaa cagagttagg atttgaactc 
tggtcccctt tgctccaagt cctgtgtttt tttccactgg catcagcggc ccctccaccc 
ccaagaggcc tccatctcac ccactctccc tacccatctt tctaggtc 

<210> 3 

<211> 14277 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> misc^feature 
<222> (1) . . (66) 
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<223> Exon CI 
<220> 

<221> misc^feature 
<222> (1802). -(1947) 
<223> Exon C2 

<220> 

<221> misc^feature 

<222> (4198).. (4350) 

<223> Exon C3 

<220> 

<221> mi scL-f eat u re 

<222> (4679) . . (4747) 

<223> Exon C4 

<220> 

<221> misc_feature 

<222> (6811) . . (6978) 

<223> Exon C5 

<220> 

<221> misc_feature 

<222> (11232) • . (11270) 

<223> Exon C6 

<220> 

<221> misc_feature 

<222> (11594) (11677) 

<223> Exon C7 

<220> 

<221> miscfeature 

<222> (13375).. (13500) 

<223> n is a, g, or t 

<220> 

<221> misc^feature 

<222> (14095) (14187) 

<223> Exon C8 

<400> 3 

accacactct acaaaggcag tcaactacat gacacattcc gcttctgcct ggtcaccaac 60 

ttgacgtaag ttctgaaggt cataagcagt gaccaagctt gtggctgtgt ctctgagcac 120 

ccttgagcta gacgtcccca gtggggtacc cattctcccc tacatccctg tctagctaat 180 

cctaccatct cctcccataa atcctcaagg tagggagtga ggattaacct catggggcca 240 

ccaactccca gcatacacct tctttttttt ctggacactt gggaaaatat aactttttga 300 

tgtagaactc aaaatattag cccaataata atatttaaca tcaaccagcc tcctctcatt 360 

taattctcac aacagaatct atgagttgag tgcaaaaatc atccctattg tgcagatggg 420 

aaaactgagg gtcagaaaag tgaacttccc aagaactgtc aaagttggga tttgaaccca 480 

ggtctctgat gactggatga aggaatgaag atacctatac ttgggaatga ggagggtcga 540 

caggacacga gggctgactt tgtatatttc taaacttcaa agattttctg tatttcagct 600 
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gggaatatgg tagaaggtta attggaacaa aaaaatgcaa agcaatgaat aagacctcag 660 

tatttgctat gcacaacagg gtgactgtag tcccacaaat aacttcactg tacattgtta 720 

aaatataact aaaggtgtat gcttggattg tttgcaacac aaaggatata tgcttgaggg 780 

gatggatacc ccatttaccc tgatgattat tatgcattac atgcttgtat caaaacatct 840 

catatacccc ataaatataa aaacacctac tatgtacccc aaaaaattaa aaacaaataa 900 

aatcaaaagt aggctgggca cagtggctca tgcctgtaat cccagcactt tgggaggcca 960 

gggcaggtgg atctcttgag cccagaagtt caagaccagc gagggcaaca tagggagtcc 1020 

tagtctctac aaaaaataca aaaattagcc aggcatggtg gcacacacct gtagtcccag 1080 

ccactcagga agctgaggtg ggaggatcgc ctgagcctag gaggctgtac tccagcctgg 1140 

gtgacagagc gagactctat ctcaaaaaat aaaataaaat aataaaaagt agaaatcaag 1200 

agggaaaatg tgggagaaat tgggataatt ttaacaatac cttccaccag agtgatgatg 1260 

aagaatgcat aagtcacttc ttagtggtct tgatctataa aaagtgttca ataaatatcg 1320 

attattgtta ctgttattgc ttctagacgt aattcctgga agcatttttt tttttttttt 1380 

ttttgagatg gagtcatgct ctgttgctca ggctggagtg cagtggtatg atctcggctc 1440 

actacaactg cctcctgggt tcaagcaatt ctcctgcctc agccccccat gtagcaggga 1500 

ctacaggcat gcgccaccac acccggtgaa gttttgtatt tttattagag acagggtttt 1560 

gccatgttgg tcaggctggt ctcgaactcc tgacctcagg caatttgcct gcctcggcct 1620 

cccaaagtgc tgagattaca ggcttgggcc actgcatcca gccgaaggcc tcccattttg 1680 

atcagaaccc ttctctagac tgagggtggg tgcctctaga tcttttgctc tttaaagaca 1740 

gcaaccgatg accctgctga tgctgagtac tggctgaatt cctgtggtct ctgtccctag 1800 

gatggactcc gtgttggtca ctgtcaaggc attgttctcc tccaatttgg accccagcct 1860 

ggtggagcaa gtctttctag ataagaccct gaatgcctca ttccattggc tgggctccac 1920 

ctaccagttg gtggacatcc atgtgacagg tacaaggtgg ggtggctggt ttcctaactg 1980 

gaagaggtgg ggttatgagg aaagatgggg cttctcggta ccagtggaat tggtggaggc 2040 

tctagagagg gaaagggagg ctttctggag acccatgtag gtgacctctg gcagtagatc 2100 

atccaacgag gcaggaacag aacaccagcc attgcatcta agagaatagc tatttttaca 2160 

tgtaaaaaga attgtgttga atgaatgaat caatagatca tttattttga atcaatttat 2220 

tgattcattc atttaattaa tgaataataa atgattcagt acataattga ttaattgatg 2280 

taattgagaa ttgatttaat tgattaattg atcaattaaa atgatcaatt aaatgaatga 2340 

atcagtaaat gaataattca ttcattcaat aaacaatgga agtaggccgg gcatggtggc 2400 

tcacgcctgt aataccagta ctttgggagg cccaggcagg cagatcacga ggtcaggaga 2460 

ttgagaccat cctggctaac acggtgaaac cctgtctcta ctaaaaatac aaaaaaaatt 2520 
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agccaggcat ggtggtggcc acctgtagtc gcagctactc gggaggctga ggcaggagaa 2580 

tggcgtgaac ccgggaggca gagcttgcag tgagccgaga tcgcgccact gcactccagc 2640 

ctgggcgaca gatggagact ctgtctcaaa aataaataaa taaataaaaa taaaaaataa 2700 

ataaacaatg gaagtaaaca cgtactgata acacagtgtg atcattgcta tgataaggga 2760 

atttcagggg cctgtgggag ccccaaggag gaacacacaa ccttgtcttg gaaagtttta 2820 

tgtaggaagg ggtgaagaag ctgagatctg acagagaatg ggacctagcc aggggtaata 2880 

gatggagaat tgtgctccat gcatctataa cctagaagat agaaagaata tggcatctgg 2940 

ccgggtgcgg tggctcacgc ctgtagtccc agcactttca gaggctgaga tgggtggatc 3000 

acctgaggtc aggagttcaa gaccagcctg accaatatga tgaaacccca tctctgctaa 3060 

aaatacaaaa attagccagg catggtggtg cgtgcctgta atcccagcca cttgggaggc 3120 

tgagagagga gaactgcttg aactcgggag gcggaggttg cagtgagccg agattgtgcc 3180 

attgcactca agcctgggca aaaagagcaa aactgcattt caaaaaaaaa aaaagtggca 3240 

ttttggggca agtttaagaa gattggtgta gctggagcat ccactttgat actggagagg 3300 

tgacagttga agccaaagat gtgggcagag actttgttgg gcactggaat ggcttgggga 3360 

ggaacatgac acactcatga gttctgcttt agaaagaaaa tgaaatgaat tctgctcatc 3420 

ctctgggtgc tgtgtgcaga atggagggtg gggggagaga agagcaaagg caagaagacc 3480 

ctttaggaac aatgatcatt agttagaaga ctctgggttt ctcagcacct gcaattgctg 3540 

actacacccc cagagaaacc cagtctcttt tcccccatgt tgtagagaat tcttacaatg 3600 

cttggtagaa agagaattga acaggtagat gggtggatgg atacaagctg gacaqatgga 3660 

tggaggaaga tcctccatcc aatatagagc tgttacctaa aaccctcca. cccacuttta 3720 

aaatcctagc tcagccaggc gcggtggctc acacctgtaa tcccagcact ttgggaggcc 3780 

aaggcgggtg gatcacttga ggtcgggggt tcgagaccag tctgaccaac atggtgaaac 3840 

ccccttctcc actaaaaata caaaaaaaaa aaaaagttag ccaggcaggg tggcgcatgc 3900 

ctgtaatccc gctactcggg aggctgaggc aggagaatgg cttgcaccca ggaggtggag 3960 

gttgtggtga gccaagatca cgccattaca ctccagcctg ggcaaagaga gtgaaactgt 4020 

ctcaaaaaac aaaacaaatg acccccctgc caaaaaaaaa aaaaaaaaaa aagaaaagaa 4080 

aaaaagaaaa gcctagctca gctcacactg tcaggaataa gtaagctagc tggaatcatc 4140 

tctttcttaa aaccctgcct tgatagtgga tttttacata cttttttttt aattctagaa 4200 

atggagtcat cagtttatca accaacaagc agctccagca cccagcactt ctacctgaat 4260 

ttcaccatca ccaacctacc atattcccag gacaaagccc agccaggcac caccaattac 4320 

cagaggaaca aaaggaatat tgaggatgcg gtgagaaggg ggtggtatgt ccactctgtt 4380 
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gccatgcaga aactgactta tgcatactgg gtagccacag ggtgactttt tataacaatc 
cacaaagaca ggttcttatt cccatttaat acacaagcac agagaggttc agtagctgac 
ccaaggtcac acagctaagt cataccctag aagagcatgt cctttgatat acatacctgg 
gcaagtggtt gtcatgacaa gaagcaaaat agacggagaa gtgtgctcag tggctgaaaa 
ttctctgatg ctactggggc caggattctg acctaagaaa catcgccctg tctttcagct 
caaccaactc ttccgaaaca gcagcatcaa gagttatttt tctgactgtc aagtttcaac 
attcaggtaa gttctaactc aggacctaat gactctagga acttctgctg tcctttaaat 
agaagtgtcc ccaagccata gctttgatgg aagagagccc tagaaataga gagctgttaa 
ctaaaaacta gctttttcct aaagctggag cccaactggc ttcaacactc aagagagctg 
gtgtaaatct cagcagacat aaaggtacct ggtgctgagg ccatggagtc tagagtgtag 
aatctactac attaagacat cagctactga aatcaggacc catggaagac gggggaagga 
ggggactaaa accagattac ttagaatcta gcagcctaac tgtgcttttc aatgagaggt 
atcatttcca atggtggggg gtaccaatga tttttttttt ttgacaactg ccttgagaac 
aggctttcct cactaaacaa attctgaatc agaacaaata aagataagcc ctgagaatag 
ggctttttca aggagctgcc aaacagatca aatagtgact atgttctgca gattgatgtc 
tggagaactc tacagctatt ttgactgcta ggcagctggt tttcacagat atcatgattc 
tgaggctgcc agttttcaaa gttaccgagg atcttgctgg atgcagtggc ttgcgactgt 
aatcccagcc ctttgggagg ccaaggtggg tagatcgctt gagctcagga gtttgagacc 
agcctgggca atatggtgaa aacccatctc tacaaaaaat acaaaaatca gctgagcata 
gtggcat <tg ctgtagtccc agttacttag gaggctgagg tgggaggatg gcttgagccc 
aggaggcaga ggttgcagtg agctgacatt gtgccatgca ctctagcctg ggcaacagag 
ccaaagcctg tctcaaaaaa aaaaaaacaa ataataataa taataaaata ctgaggatct 
tgaaagagca ctgtggaaat aatgcaagtt aaaatgccac aaagcttgct ctttttactg 
agatttaaca ctttccttaa ctaaacaccc ctcgaatttt tgcaagcctt tggttcactt 
ctagacttct ggaaaaattg atttggacta ttttggccaa tgttctcatt gattttatgg 
gtattcagaa gttgttaccc caacattcca gaaatgttct ccctgtggct attactttat 
ttatttattt atttatttat ttatttattt atttgagacg gagtctccct ctgttgccca 
ggctggagtg cagtggcgca atctcagctc actgcaacct ccgcttccca ggttcaagcg 
attctcctgc ctcagcctcc caagtagctg ggattatgga tgtgcaccac cacaccggct 
aatttttgtg tttttagtag agatggggtt tcactgtgtt ggccaggctg gtctcgaact 
cctgatctca agtgatccac ccgccttggc ctcccaaagt gctgggataa caggcatgag 
ccactgtgcc tgacctccct gtggctattt ttaaatgaat taagtggaat aaaattagaa 
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attcagttct tctcccacgc tagctgcatt ttaagcattt aataacaaca tgaagctact 
aatggctgca ttgtgtagtg cagatgtaga attttttttt tgttttttgt tttgtttttg 
agatggagtc tcgctctgtc accaggctag agtgcagtgg cgtgatctcg tctcactgca 
atctctactc cccgattcaa gtgattctcc tgcctcagcc tcccaagtag ctgggattac 
aggcacgtgc caccacaccc agctaatatt tgtatggatg gtctcaatct cctgacctcg 
tgatttgtat ggatggtctc gatctgacct catgatccgc ctgcctgggc ctcccaaagt 
gctgggatta caggcgtgag'ccactgtgcc cggccgacat agaatgttta catcattgca 
gaaagtttct gcaggaagag cctagaagga gaaagcctag aatcatgata aaattgcaga 
tatctttgct tatccctgtc cccttccagg tctgtcccca acaggcacca caccggggtg 
gactccctgt gtaacttctc gccactggct cggagagtag acagagttgc catctatgag 
gaatttctgc ggatgacccg gaatggtacc cagctgcaga acttcaccct ggacaggagc 
agtgtccttg tggatggtaa agctccctgg gtcattggga ctgaggtgga agctcccact 
tcctcacctg ggtccttccc tgggaatctg aaggcttggg gttgattcgt catcgagctt 
tctcagactg ggagaaagtg gcttagttct cctaagcttt acccatcatt gaaggaaaga 
aaaggacgcc cgagggatat gggaggcatt tgccctcttc tggccagctc tgtgacctca 
ggctagtcac atctcctttc tggacttctt atctctctgt acttagcaag ccacttggtt 
tttggttccc atcttgcctg ccctagatgg tattgctcct ccacccccag gcagctgcag 
tgttaaacaa ttaccctgat tagttattgt tgttgtgttg tttgtttgtt tttgagacag 
ggtctcactc tgtcacctag gctggagtgc agtgacatga tctcagctca ctgcaacctc 
aacccctgga ctcaagcaat ccacccactt cagcctccca agtaactggg actacagcca 
tgcgccacca cacccggata atttttgtat tttttctaga gatggggttt tgcaacattg 
cccaggctgg tcttgaactc ctgagctcaa gcatgccacc tgcttcagcc tcccaaagtg 
ctgggattac aggcaggcag gcaccactgc agctggttct ggttttttgt gtttgttttt 
ttcttttaga ggcagggtct cgctctgtta accagaatgg agtacagtgg tgcaatcata 
gctcactgca gtcttgaact cctgggctca agcgatcctc ccacctcagc ctcctgagta 
cctggaacta caggcacgtg tcaccacgcc ttgctaattt ctaaattttt tgtagagaca 
gggtctcact atgttgccca gactggtctc taattcctgg ccacaagtga tcctcctgcc 
tcagcaggtc aatgagggct tccagtttca agttgtatgt gattcatcct caacaaatgt 
ggtaggatgg acctattttc caactccaga gatggcttca aggtggctca actttgcata 
tccaatttta cccattcaaa gaatagttat atacattgta ccatgtatca ggaatataac 
agagagtaac tgtttgctct ttcaccacta tattccaaga accccatatt ctgcctggca 
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cataataaac actcaagtca tatttgcaga aggaataact agatttcata caaggttctt 8220 

ttcaagtcaa atgcgaataa cgttttagac gggaccttcc aatgcctgtg tgcactgtcc 8280 

ttgattccga attattgttg tgcaagagag cactgttgat ccttcagaat caacaagcct 8340 

ttcacatgcc tgtcacaggt ttttcttttt cttgttttac caattttgtt tgttgtttgt 8400 

ttgttgttat tgttttgttt tgtttttgtt ttttatttgt ttttattttt tctttttttt 8460 

tgagacagag tctcgctctg tcacccaggc tggagtgcag tggcacgatc tccgcccact 8520 

gcaagctccg cctcctgggt tcatgccatt ttcctgcctc agcctcctga gtagctggga 8580 

ctacaggcgc ctgccaccat gtctggctaa ttttttttgt atttttagta gaaacagggt 8640 

ttcaccatgt tgaccaggat ggtctcgatc tcctgacctc gtgatctgcc cacctgggcc 8700 

tcccaaagtg ctgggattac aggcgtgagc caccacaccc agccccaatt ttttttttaa 8760 

ttaaaattgt tgtcagctca caagctttct aaaaacaggc catggaccca gcatcgctgt 8820 

agtttgccaa acccttgcct tgaatcagta ccatccaata gaactttctg cagtgataga 8880 

aaatgtttct atctgtgcta ttcagcacaa agccatgtgt gattactaag cttgaagtgt 8940 

ggttaatgta actgagatac cgaagtttta attttattta attttaattt aaaaagccac 9000 

ttgtggctgc tccatattgc acactacttt ttaaaattat tatttgtata tatttaaggg 9060 

gcacaagtac aattttgttg catggattta tagcccagtg gggaagtctg ggcttttagg 9120 

gtatctatta cctgaataat gtacattgta cccattgagt aatttctcat catccactct 9180 

cctccactcc ccaacccttc caagtttcca ctgtctatta ttccactctc tatgtccatg 9240 

cctatgcatt atttagcatt gacatgtcta tgcattattt agtcaaatac atgtgctatt 9300 

tgacttcctg tatctgagtt gtttgactta agataatgac cttcacttgc atccatgttg 9360 

ctgcaaaaga catgatttca ttctttttta tgcctgggtg gtattgcatt gtgtgtgtgt 9420 

gtgtgtgtgt gtgtgtagag agagagagag atcacatttt ctttatacag tcctccattg 9480 

atgggcactt aggttgattc catatctttg ctattgtgaa tagttttgtg ataaacacac 9540 

aggttcaggt gtctttttga caaaattatt tattttcctt tgtgtagata cccagtcgtg 9600 

ggattcctgg atcaaatggt agtttcattt ttagttattt gagaaatctc cacgtttttc 9660 

atagagatta tactaaatta cattcccacc aacagtgtgt aacggttcac ttttcttgca 9720 

tcctttttaa catctgttat ttttgtcttt ttagtaacag ccattctgac tggcgtaagg 9780 

tggtatctca tcatggtttt aatctgtatt tctctgatta ttagtaatgt cgagcatttt 9840 

ttcatatgct tgttagccat tggtatgtct tctacatctt taagaagctg gctatgggct 9900 

gggcgcagtg gctcacacct gtaatcccag cactttggga ggccgaggca ggcggatcac 9960 

gaggtcagga gttaaaaacc agcctggcca acatggtaaa accctgcctc tactaaaaat 10020 

acaaaaaatt acccaggcat ggtggtgcgc ctgtaatccc agctactcag gaagctgagg 10080 
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caggagaatc acttgaaccc aggaggcgga ggttgcagtg agacgagatc acatcattgc 
actccagcct gggtgacaga gtgagactct atcttgagaa aaaaaaaaag ttggctataa 
cagggttgta gaagtagagg aaccagtaac ccttctcgcc atgcctgatg atggctttac 
atccctgtct tcatggagtt tatgctgtcg tgaggaataa caagaacagg cagttgtcaa 
ttataaatta tttgatgtga acctattcat acatgggtgt ggtcatcagg gaaggcttcc 
tggaggaaat gacattgaag gtgaattcta aaagatgacg ataaaccacc aagtgaagga 
gagcttaaat gtgtttttag gcagaagaaa aaccttttgg gtgaaaattt taaaacttag 
agaggtccca tcagtttcca actgcgatga tccattctct ccaccactgc ccttgggccc 
agcccaattt aggtccacca tgcccagagg catgaattta acttatgaca ctcttgtggt 
ggaataatgg ctttgggctt atgtagccat gtgtcatttt tttagagata caaattgaaa 
tatttggggt gagatgtcat ggtgtctact ggcctctaaa acttcagtga aaacatttac 
tttcactgaa atgtcaataa atcataaatt ggatgtatat gttttagttg gaggaaatat 
aaaccactaa atctaggtga tgcatattta ttatactctt ctctctgctt ttttgtacgc 
ttgtaaaatt gtatttaaaa gaataagaca cacttggccg ggcgcggtgg ctcacgcctg 
taatcccagc actttgggag accgaggtgg gtggatcatg aggtcaggag ttcaagacca 
gcctggccaa catggtaaaa ctccatcact acatacaaaa attagccagg cattttggcg 
ggcacctgta atctcagcta cttgggaggc tgaagcagga gaattgcttg aacccgggaa 
gcagaggttg cagtgagcca agatcacgcc actgcactct agcctgggca acagagcaag 
actccatctc cagaaaaaaa aaaaaaaaaa gacacactca catgcaccct ccatttcttt 
catttctagg gtattctccc aacagaaatg agcccttaac tgggaattct ggtaagtctc 
aaagaagccc cagcccaggg tagggagggg gtagcctgat ggtgctttgc cttgtccaag 
agcaccaggc acacagagtc ttggatgagg atcaaaattg ccaacccatg gcaaagacta 
ttgaggcata gtaaagggat agcagggatc ctggctttct gggggcccag tttttggggg 
catcagaggc atgaggtgtt gagccactaa gctctcttcc ccaggggctg tgcccatcct 
caggccacat agggtccaag aaggagccct gggacgtggc aggaggtggc tcaccccagc 
ccttgtctcc ccagaccttc ccttctgggc tgtcatcctc atcggcttgg caggactcct 
gggagtcatc acatgcctga tctgcggtgt cctggtgagc aaggaagggt tgcttgtctt 
cttaacaatt gggttgtaag agttcttaat atattataaa accatactat actatacaca 
agtcctttgc tggatatatg ttttgcaaat attttctccc agttcacgga gtggctttcc 
tattttcttt ttataatttt atttttaatt aattgacaaa taatgaatgc atatatttag 
gggatacaat gtgatgcttt ggtatatgta caattatgga atgactcaat caagctaatt 
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aatatgtccc tcacctctca tacttattat ttctttgtgg tgtgaacatt ggcaacctat 12000 

actcttagca attttgaaat ctacattatt attaactata gttactatgt tatgcagatc 12060 

tcaaaaactt cacaacctat atgctgatta caagatattg agagaaaaag tgattgcaaa 12120 

gagtgtaaat aaaataatgt aagagggaaa aatgtaacaa aattagtcgt tagggaaatg 12180 

tacacggaag tcacaatgag aggccacttt tcacaagaat ggataaaatt gaaaagattg 12240 

actataacaa gtgttggtga aaatgtgaca gaactggaac tctcataaag tgaaagtgga 12300 

aaatagcttg gccatttctt tgaaaattac acacacctac cgtaagacct accatcccac 12360 

tactagtaat ttatctaaga gaaataaaaa catatgtcta tatgaagact tgtacacaag 12420 

taaatgttca taacagcttt gtttgtaata gccaaactct gaaaacaagc ccctaatgtc 12480 

cattaacaaa tatatcctga caatggaata ttattcagca acaaaaagga attattaata 12540 

cattaataaa ttatacagca acatgtataa attgcaaaat agttatgcct agtgaaagaa 12600 

tccagatgaa gaaaagagta catgccatat gattccctta atagacaaat tctagaaaat 12660 

acaaactaat ctgtaaggac aggaatcaga tcagcggttg cctgggaatg aaaatgtgtt 12720 

tgcagtggca gggaaaaagg aattgtaaaa gagcaggaag aaagtttttt tgttgttttt 12780 

tttttgtttt ttcttgagac agagtcttag tctatcgccc aagctggagt gcaatggcac 12840 

gatctcagct cattgcaacc tctgcctctc gggttcaagc gtttttcctg ccccagcctc 12900 

ccaagtagct gggattacac atgcgcacca ccacactcag ctaatttttg tatttttagt 12960 

agagacgggg ttttaccatg ttggccaggc tggtctcgaa ctcctgacct caggtgatcc 13020 

acccgccttg gcctcccaaa gtgctgggat tacaggagtg agccaccatg cctggccagg 13080 

acgaaagttt tggggatgat ggatggatgt tccttatgtt gattgtggtg acgattcaat 13140 

aagttatgat cagaacttat caaaacattc actttaaatg tgtgcagttt attttatgtc 13200 

agttatgcct cagttaagct ggacagatgt agaggaggaa gggagggaga gagggggctg 13260 

agatcaggac caaaagccag agagaaagag actgagaatg agatgagaga gaaatggtat 13320 

ttagacagaa gacaggcgat agatgattga tagttgacag atgattggtg gatannnnnn 13380 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13440 

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 13500 

aggaggttta aacaaaacgc aattatgttg aaatgacaat gattgtggat ataaaggtag 13560 

atagaaatag atatttgtga agataatggt tagataaaaa tgataggtaa cagatattga 13620 

tagatcttga taagtagatg ataaatacat gattgatgga tgacaggtga ttgatagatg 13680 

atttgatgga ttataaatag gagatgattg agaggtgaga gataattgat ggttatttga 13740 

ttggtagata attgattgac aggttgataa atattgatag ctagatgata gataaataga 13800 

tcattggtag atatgtgata tattgataaa gaaattcaga ggcaaaagga gagagaaatg 13860 
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aaggggatat cggaggggga aaaatttttt taaaccgaga gtgaaacaag gagacagaag 13920 

aaaagaaagt ggtgaaaaga ggaaaagaac tgagggagaa attaaatgaa acaatgaagg 13980 

gagacagagg aagcataagg cctctggctt tggccatatt ctcacccctg tggtctcctc 14040 

tccctggacg gctgaccagt ccattctcac gcctcctcct caccctcata ggtgaccacc 14100 

cgccggcgga agaaggaagg agaatacaac gtccagcaac agtgcccagg ctactaccag 14160 

tcacacctag acctggagga tctgcaatga ctggaacttg ccggtgcctg gggtgccttt 14220 

cccccagcca gggtccaaag aagcttggct ggggcagaaa taaaccatat tggtcgg 14277 



<210> 4 

<211> 66765 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc^feature 

<222> C38757).. (38757) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (38775) . . (38775) 

<223> n is a, c, g, or t 

<220> 

<221> raisc^feature 

<222> (38778) . • (38778) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (38835).. (38835) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (38838) . . (38838) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (38859).. (38859) 

<223> n is a» c, g, or t 

<220> 

<221> misc^feature 

<222> (38868) . . (38868) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (38883) . . (38883) 

<223> n is a, c, g, or t 

<220> 
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<222> (38892).. (38892) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (41833).. (41833) 

<223> n IS a, c, g, or t 

<220> 

<221> misc_feature 

<222> (41838) . . (41838) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (41843) . . (41843) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (41847) . . (41847) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (41859).. (41859) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (41865).. (41865) 

<223> n is a, c, g, pr t 

<220> 

<221> misc-feature 

<222> (41874).. (41876) 

<223> n is a, c, g» or t 

<220> 

<221> misc_feature 

<222> (41881) . . (41883) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (41892) . . (41892) 

<223> n is a, Cp g, or t 

<220> 

<221> misc^feature 

<222> (41911) . . (41911) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (41942) . . (41942) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (41945) . . (41945) 

<223> n is a, d g, or t 
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<220> 

<221> misc_feature 

<222> (41951) . . (41951) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (41986) . . (41986) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (41989) . . (41989) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (41991).. (41991) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (41994) . . (41994) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (42000) . . (42000) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (42016) . . (42016) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (42022) . . (42022) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (42026) . . (42026) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (43909) . - (43909) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (43911) . . (43912) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (43916) -.(43916) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 
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<222> (43927) . . (43927) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (43929).. (43929) 

<223> n is a, c, g, or t 

<220> 

<221> raisc-feature 

<222> (43932).. (43932) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (43939) . . (43939) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (43947) . . (43947) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (43965).. (43965) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (43968) . . (43968) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (43970).. (43970) 

<223> n is a, c, g, or t 

<220> 

<221> iirisc_feature 

<222> (43974) . . (43975) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (43982) . . (43983) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (43985) . . (43985) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (43989) . . (43989) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 
<222> (43992) . . (43992) 

<223> n is a, c, g, or t 
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<220> 

<221> misc_feature 

<222> (44013) . . (44013) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (44377) . . (44377) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (44379) . . (44380) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (44384) . . (44384) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (44395) . . (44395) 

<223> n is a, c, g, or t 

<220> 

<221> tni sc_f eature 

<222> (44397) . . (44397) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (44400) . . (44400) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (44407).. (44407) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (44415) . . (44415) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (44433).. (44433) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc-f eature 

<222> (44436).. (44436) 

<223> n is a, c, g, or t 

<220> 

<22 1> mi s c_f eat u r e 

<222> (44438) . . (44438) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (44442) . . (44443) 



022438. 43867 (pct2) .ST25.txt 



Page 72 



wo 2004/045553 



PCT/US2003/037041 





022438 . 43867 (pct2) . ST25 . txt 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (44450) . . (44451) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (44453) , . (44453) 

<223> n is a, c, g, op t 

<220> 

<221> misc_feature 

<222> (44457) . . (44457) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (44460) . . (44460) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (44481) . . (44481) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (47185) . . (47185) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (47187) . . (47188) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (47192).. (47192) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eat u re 

<222> (47203) . . (47203) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (47205) . . (47205) 

<223> n is a, c, g, or t 

<220> 

<221> fnisc_feature 

<222> (47208) . . (47208) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (47215) . . (47215) 

<223> n is a, c, g, or t 



<220> 
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<221> misc_feature 
<222> C47223)..C47223) 
<223> n IS a. c, g, or t 

<220> 

<221> raisc_feature 

<222> (47241).. (47241) 

<223> n IS a, c, g, or t 

<220> 

<221> iinsc_feature 

<222> (47244) . . (47244) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (47246) . . (47246) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (47250) .. (47251) 

<223> n is a, c, g, or t 

<220> 

<221> tnisc_feature 

<222> (47258).. (47259) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (47261).. (47261) 

<223> n IS a, c, g, or t 

<220> 

<221> niisc_feature 

<222> (47265).. (47265) 

<223> n IS a, c, g, or t 

<220> 

<221> niisc_feature 

<222> (47268).. (47268) 

<223> n is a, c, g, or t 

<220> 

<221> nrisc_feature 

<222> (47289).. (47289) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (48118).. (48118) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (48120).. (48121) 

<223> n is a, c, g, or t 

c220> 

c221> miscL-feature 

c222> (48125).. (48125) 

:223> n is a, c, g, or t 



022438 . 43867 (pct2) . ST25 . txt 



Page 74 



wo 2004/045553 



PCT/US2003/037041 





022438. 43867 Cpct2) .ST25.txt 



<220> 

<221> mi sc_f eatu re 

<222> (48136) . . (48136) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (48138) . . (48138) 

<223> n is a, c, g, or t 

<220> 

<221> miscufeature 

<222> (48141) . . (48141) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (48148).. (48148) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48156) . . (48156) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (48174) . . (48174) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (48177) . . (48177) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (48179) . . (48179) 

<223> n is a, c, g, or t 

<220> 

<2 2 1> mi s c_f eatu re 

<222> (48183) . . (48184) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48191) . . (48192) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48194) . . (48194) 

<223> n is a, c, g, or t 

<220> 

<221> raisofeature 
<222> (48198) . . (48198) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 
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<220> 

<221> inisc_feature 
<222> (48222) . . (48222) 

<223> n is a, c, g, or t 

<220> 

<22 !> mi s c_f eat u re 
<222> (48226) . . (48226) 

<223> n is a, c, g, or t 

<220> 

<2 2 1> mi s c_f eatu re 

<222> (48228) . . (48228) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48231) . . (48231) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48246) . . (48249) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (482 53).. (482 53) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48255).. (48255) 

<223> n is a, c, g. or t 

<22b> 

<221> misc-feature 

<222> (48265) . . (48265) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48273) . . (48273) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48278) . . (48278) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (48305) . . (48306) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48313) . . (48313) 

<223> n is a, c, g, or t 
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<220> 

<221> inisc_feature 

<222> (48315).. (48315) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48322).. (48322) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48327) . . (48327) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (48340) . . (48340) 

<223> r» is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48356).. (48356) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (48362).. (48363) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48365) . . (48365) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48367).. (48367) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (48369).. (48369) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48378) . . (48378) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (48388) . . (48388) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (48393) . . (48393) 

<223> n is a, c, g. or t 

<220> 



<221> 
<222> 



mi sc^f eature 
(48398).. (48398) 
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<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (48402) . , (48402) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (48414) . . (48414) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (48420) . . (48420) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (48429) . . (48431) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (48436) . . (48438) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (48447) . . (48447) 

<223> n is a, c, g, or t 

<220> 

<221> mi scL-f eature 

<222> (48466) . . (48466) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49006) . . (49006) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49009).. (49009) 

U223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49011) . . (49011) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49014) . . (49014) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc-.f eature 

<222> (49020) . . (49020) 

<223> n is a, c, g, or t 

<220> 
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<221> misc-feature 

<222> (49036).. (49036) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (49042).. (49042) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (49046) . . (49046) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (49321) . . (49321) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (49326).. (49326) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (49331) . - (49331) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (49335). -(49335) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (49347) . . (49347) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (49353).. (49353) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (49352) . . (49364) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (49369).. (49371) 

<223> n is a, c, g, or t 

<220> 

<221> misC-feature 

<222> (49380) . . (49380) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc»f eatu re 

<222> (49399) . . (49399) 

<223> n is a, d g, or t 
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<220> 

<221> misc_feature 

<222> C49430).. (49430) 

<223> n Is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (49433).. (49433) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49439) . . (49439) 

<223> n Is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (49474) . . (49474) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

■ <222> (49477) . . (49477) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (49479) . . (49479) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49482) . . (49482) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49488) . . (49488) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49504) . . (49504) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49510) . . (49510) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49514) . . (49514) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (49519) . . (49519) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
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<222> (49521) . . (49522) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (49526).. C49S26) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (49537).. (49537) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature , 

<222> (49539).. (49539) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (49542).. (49542) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (49549).. (49549) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (495 57).. (49557) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (49575).. (49575) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (49578).. (49578) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (49580).. (49580) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (49584).. (49585) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 
<222> (49592) . . (49593) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 
<222> (49595).. (49595) 
<223> n is a, c, g, or t 
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<220> 

<221> mi sc_f eatu re 

<222> (49599) . . (49599) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (49602) . . (49602) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (49623) . . (49623) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50725)., (50725) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50730) . . (50730) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (50735) .. (50735) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (50739) . . (50739) 

<223> n- is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50751) . . (50751) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_feature 

<222> (50757) . . (50757) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (50766) . . (50768) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (50773) . . (50775) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (50784) . . (50784) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50803) . . (50803) 
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<223> n is a, c, g, or t 
<220> 

<221> tnisc_feature 

<222> (50834) . . (50834) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50837) . . (50837) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50843) . . (50843) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50878) . . (50878) 

<223> n is a, c, g, or t 

<220> 

<221> tnisc_feature 

<222> (50881).. (50881) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50883).. (50883) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50886) . . (50886) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (50892) . . (50892) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (50908) . . (50908) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (50914).. (50914) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50918) . . (50918) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (50923).. (50923) 

<223> n is a, c, g, or t 

<220> 
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<221> misc^feature 
<222> C5092S).. (50926) 
<223> n is a, c, g, or t 

<220> 

<221> raiscL-feature 
<222> (50930).. (50930) 

<223> n is a, c, g, or t 

<220> 

<221> rnisc_feature 
<222> (50941) . . (50941) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50943).. (50943) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (50946).. (50946) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (50953).. (50953) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (50961) . . (50961) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (50979) . . (50979) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (50982) . . (50982) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (50984).. (50984) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (50988) . . (50989) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50996) . . (50997) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (50999) . . (50999) 

<223> n is a, c, g, or t 
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<220> 

<221> niisc_feature 

<222> (51003) . . (51003) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (51006) . . (51006) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (51027) . . (51027) 

<223> n is a, c, g, or t 

<220> 

<22L> misc_feature 

<222> (52597).. (52597) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52602).. (52602) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (52607) . . (52607) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (52611) . . (52611) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (52623) . . (52623) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (52629).. (52629) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (52638).. (52640) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52645) . . (52647) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (52656).. (52656) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
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<222> (52675).. C52675) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52706).. (52706) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (52709).. (52709) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (52715).. (52715) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52750).. (52750) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52753).. (52753) 

<223> n is a, c, g, or t 

<220> 

<221> misc_featur€ 

<222> (52755).. (52755) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (52758) . . (52758) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52764) . . (52764) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52780) . . (52780) 

<223> n is a, c, g, or t 

<220> 

<221> misc„feature 

<222> (52786).. (52786) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52790) .. (52790) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (52795) . . (52795) 

<223> n is a, c, g, or t 
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<220> 

<221> misc_feature 

<222> (52797).. (52798) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (52802) . . (52802) 

<223> n is a, c, g, or t 

<220> 

<221> niisc_feature 

<222> (52813).. (52813) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52815) . . (52815) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (52818).. (52818) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (52825).. (52825) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52833).. (52833) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52851).. (52851) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52854).. (52854) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52856).. (52856) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (52860) . . (52861) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (52868) . . (52869) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc-f eature 

<222> (52871).. (52871) 
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<223> n 1s a, c, g, or t 
<220> 

<221> mi sc_f eatu re 

<222> (52875).. C52875) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (52878) . . (52878) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (52899) . . (52899) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52903).. (52903) 

<223> n is a, c, g, or t 

<220> 

<22]> mi sc_f eatu re 

<222> (52905) . . (52905) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (52908) . . (52908) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52923) . . (52926) 

<223> n is a, c, g, or t 

<220> 

<2 21> mi sc_f eatu re 

<222> (52930) . . (52930) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (52932) . . (52932) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (52942).. (52942) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52950) . . (52950) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52955).. (52955) 

<223> n is a, c, g, or t 

<220> 
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<221> miscfeature 
<222> (52982) . . (52983) 
<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52990) . . (52990) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52992) . . (52992) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (52999).. (52999) 

<223> n is a, c, g, or t 

<220> 

<221> raisc-feature 

<222> (53004) . . (53004) 

<223> n is a, c, g, or t. 

<220> 

<221> misc_feature 

<222> (53017) . . (53017) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (53033) . . (53033) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (53039) . . (53040) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (53042) . . (53042) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53044).. (53044) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53046) . . (53046) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53055).. (53055) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 
<222> (53065) . . (53065) 
<223> n is a, c, g, or t 
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<220> 

<221> mi sc_f eatu re 

<222> (53070).. C53070) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53075).. (53075) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (53079)., (53079) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53091).. (53091) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (53097) . . (53097) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (53533).. (53533) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (53538).. (53538) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53543).. (53543) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (53547).. (53547) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (53559).. (53559) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (53565).. (53565) 

<223> n is a, c, g, or t 

<220> 

<221> mi scL.f eature 

<222> (53574).. (53576) 

<223> n is a, c, g, or t 

<220> 

<221> mi scL.f eatu re 
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<222> (53581) . . (53583) 
<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (53592).. (53592) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (53611) . . (53611) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (53642).. (53642) 

<223> n is a, c, g, or t 

<220> 

<221> niisc_feature 

<222> (53645) . . (53645) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53651).. (53651) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (53686).. (53686) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53689) . . (53689) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (53691) . . (53691) 

<223> n is a. c, g, or t 

<220> 

<221> inisc_feature 

<222> (53694).. (53694) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53700).. (53700) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53716) . . (53716) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (53722).. (53722) 

<223> n is a, c, g, or t 
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<220> 

<221> inisc_feature 

<222> (53726) . . (53726) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53731) . . (53731) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53733) . . (53734) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (53738).. (53738) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53749) . . (53749) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53751) . . (53751) 

•<223> n is a, c,. g, or t 

<220> 

<221> inisc_feature 

<222> (53754).. (53754) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53761).. (53761) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53769) . . (53769) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53787) . . (53787) 

'<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53790) . . (53790) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53792) . . (53792) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53796) . . (53797) 
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<223> n is a, c, g, or t 
<220> 

<221> misc_feature 

<222> C53804).. (53805) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (53807).. (53807) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (53811).. (53811) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53814) . . (53814) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53835).. (53835) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53839),. (53839) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53841).. (53841) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (53844).. (53844) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (53859) . • (53862) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (53866).. (53866) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (53868).. (53868) 

<223> n is a, c, g, or t 

^220> 

<221> misc_feature 

<222> (53878).. (53878) 

<223> n is a, c, g, or t 



<220> 
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<221> mi scL.f eatu re 

<222> (53886).. (53886) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53891) - . (53891) 

<223> n is a, c, g, or t 

<220> 

<221> mi s cofeature 

<222> (53918).. (53919) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53926) . . (53926) 

<223> n is a, c, g, or t 

<220> 

<221> mi scL.f eatu re 

<222> (53928).. (53928) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (53935).. (53935) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53940) . . (53940) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53953).. (53953) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (53969).. (53969) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (53975).. (53976) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53978).. (53978) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (53980) . . (53980) 

<223> n is a, c, or t 

<220> 

<221> mi sc_f eatu re 

<222> (53982) . . (53982) 

<223> n is a, c, g, or t 
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<220> 

<221> inisc_feature 

<222> (53991) . . (53991) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54001).. (54001) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54006) . . (54006) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54011) . . (54011) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54015) . . (54015) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54027) . . (54027) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54033) . . (54033) 

<223> n is a, c, g. or t 

<220> 

<221> mi sc_f eature 

<222> (54042) . . (54044) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54049) . . (54051) 

<223> n is a, c, g, or t 

<220> 

•<221> roisc_f eature 

<222> (54060).. (54060) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54079) . . (54079) 

<223> n is a, c, g, or t 

<220> 

<2 2 1> mi s c_f eat u r e 

<222> (54469) . . (54469) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 
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<222> (54474).. (54474) 

<223> n 1s a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54479) . . (54479) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54483).. (54483) 

<223> n Is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54495).. (54495) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54501) . . (54501) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54510) . . (54512) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54517) . . (54519) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54528) . . (54528) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54547) . . (54547) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54578) . . (54578) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54581) . . (54581) 

<223> n is a, c, g, or t 

<220> 

<221> mi s<L.f eature 

<222> (54587) . . (54587) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54622) . . (54622) 

<223> n is a, c, g, or t 
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<220> 

<221> tnisc_feature 

<222> C54625).. (54625) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54627) .. C54627) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54630) . . (54630) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (54636) . . (54636) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54652) . . (54652) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54658) . . (54658) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54662) . . (54662) 

<223> n is a, c, g, or t 

<220> ■ 

<221> misc_feature 

<222> (54667) . . (54667) 

<223> n is a, c, g, or t 

<220> 

<221> mistL-feature 

<222> (54669) . . (54670) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54674) . . (54674) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 
<222> (54685) . . (54685) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (54687).. (54687) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (54690).. (54690) 
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<223> n is a, c, g, or t 

<220> 

<221> ml scL.f eature 

<222> (54697) . . (54697) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54705) . . (54705) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (54723) . . (54723) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54726) . . (54726) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54728) . . (54728) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54732) . . (54733) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54740) . . (54741) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54743) . . (54743) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54747) . . (54747) 

<223> n is a, c, g, or t 

<220> 

•<221> mi sc_f eature 

<222> (54750) . . (54750) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (54771) . . (54771) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54775) . . (54775) 

<223> n is a, c, g, or t 

<220> 
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<221> misc_feature 

<222> (54777).. C54777) 

<223> n is a, c, g. or t 

<220> 

<221> tni sc_f eatu re 

<222> (54780).. C54780) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54795).. (54798) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54802) . . (54802) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54804).. (54804) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54814) . . (54814) 

<223> rv is a, c, g, or t 

<:220> 

<221> mi sc_f eature 

<222> (54822).. (54822) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (54827) . . (54827) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (54854)., (54855) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54862).. (54862) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54864) . . (54864) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (54871).. (54871) 

<223> n is a, c, g, or t 

<220> 



<221> 
<222> 
<223> 



mi sc_f eature 
(54876).. (54876) 



n is a, c, g, or t 
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<220> 

<221> misc_feature 

<222> (54889).. (54889) 

<223> n Is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54905).. (54905) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54911) . .(54912) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> ( 54914) . . (54914) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54916) . . (54916) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (54918) . . (54918) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54927) . . (54927) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (54937) . . (54937) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54942) . . (54942) 

<223> n is a, c, g, or t 

<220> 

<221>. misc_feature 

<222> (54947).. (54947) 

<223> n is a, c, g, or t 

<220> 

<221> mi s<L.f eature 

<222> (54951) . . (54951) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (54963) . . (54963) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
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<222> (54969).. (54969) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54978).. (54980) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54985).. (54987) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (54996) .. (54996) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55015).. (55015) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (55405).. (5 5405) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (5 5410).. (5 5410) 

<223> n is a, c, g, or t 

<220> 

<221> tnisc_feature 

<222> (55415).. (55415) 

<223> n is a. c, g, or t 

<220> 

<221> misc_feature 

<222> (55419).. (55419) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55431).. (55431) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 
<222> (55437).. (55437) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (55446).. (55448) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (55453).. (55455) 
<223> n is a, c, g. or t 
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<220> 

<221> mlscfeature 

<222> (55464) . . (55464) 

<223> n 1s a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55483).. (55483) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55514).. (55514) 

<223> n is a, c, g, or t 

<220> 

<2 21> mi s c_f eat u re 

<222> (55517) . . (55517) 

<223> n is a, c, g, or t 

<220> 

<221> m1sc_feature 

<222> (55523).. (55523) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (55558).. (55558) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

, <222> (55561) . . (55561) 

<223> n is a, c, g, or t 

<220> 

<2 2 1> mi s c_f eatu re 

<222> (55563) . . (55563) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (55566) . . (55566) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55572).. (55572) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55588).. (55588) 

<223> n is a, c, g, or t 

<220> 

<221> miscL.feature 

<222> (55594).. (55594) 

<223> n is a, c, g, or t 

<220> 

<221> m1scL.feature 

<222> (55598) . . (55598) 
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<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55603).. (55603) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55605) . . (55606) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (55610).. (5S610) 

<223> n is a, c, g. or t 

<220> 

<221> mi sc_f eature 

<222> (5 5621).. (55621) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (55623).. (55623) 

<223> n is a, c, g. or t 

<220> 

<221> mi s(L.f eature 

<222> (55626).. (55626) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55633).. (55633) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (55641).. (55641) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (55659).. (55659) 

<223> n is a, c, g, or t 

<220> 

<221> mi s<L.f eature 

<222> (55662).. (5 5662) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (5 5664).. (5 5664) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (5 5668).. (55669) 

<223> n is a, c, g, or t 



<220> 
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<2 2 1> mi s c_f eat u re 

<222> (55676) . . (55677) 

<223> n 1s a, c, g, or t 

<220> 

<221> misc_feature 

<222> (5 5679).. (55679) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (55683).. (5 5683) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55686).. (55686) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55707). .(55707) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (557U).. (55711) 

<223> n Is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55713).. (55713) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55716).. (55716) 

<223> n is a, c, g, or t 

<220> 

<221> mi scfeature 

<222> (55731) . . (55734) 

<223> n is a, c, g, or t 

<220> 

.<221> misc_feature 

<222> (55738).. (55738) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 
<222> (55740).. (55740) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 
<222> (55750).. (55750) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 
<222> (55758) . . (55758) 
<223> n is a, c, g, or t 
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<220> 

<2 21> mi sc_f eat u re 

<222> (55763).. (55763) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55790).. (55791) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (55798).. (55798) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (55800) . . (55800) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55807).. (55807) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (55812).. (55812) 

<223> n is a, c, g. or t 

<220> 

<221> mi sc_f eat u re 

<222> (55825).. (55825) 

<223> n is a, c, g, or t 

<220> 

<221> mis<L.feature 

<222> (55841).. (55841) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55847).. (55848) 

<223> n is a, c, g, or t 

<220> 

<2 21> mi s c_f eat u re 

<222> (55850).. (55850) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (55852).. (55852) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (55854).. (55854) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
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<222> (55863).. (55863) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (55873) . . (55873) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (55878).. (55878) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55883).. (55883) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (55887) (55887) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (55899).. (55899) 

<223> n is a, c, g, or t 

<220> 

<221> mi scL-f eature 

<222> (55905).. (55905) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc-f eature 

<222> (55914).. (55916) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (55921).. (55923) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (55932). -(55932) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc-f eature 

<222> (55951) . . (55951) 

<223> n is a, c, g, or t 

<220> 

<221> mis cofeature 

<222> (56341) . . (56341) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56346).. (56346) 

<223> n is a, c, g, or t 
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<220> 

<221> misc_feature 

<222> (56351).. (56351) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56355).. (56355) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56367).. (56367) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56373).. (56373) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc-f eature 

<222> (56382).. (56384) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56389) . . (56391) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (56400).. (56400) 

<223> n.is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56419).. (56419) 

<223> n is a, c, g. or t 

<220> 

<221> mi sc_f eatu re 

<222> (56450) . . (56450) 

<223> n is a, c, g. or t 

<220> 

<221> misc_feature 

<222> (56453) . . (56453) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56459).. (56459) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56494) . . (56494) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 



<222> 



(56497) . . (56497) 
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<223> n is a, c, g, or t 
<220> 

<221> mi sc_f eatu re 

<222> (56499).. (56499) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (56502).. (56502) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (56508) . . (56508) 

<223> n is a, c, g, or t 

<220> 

<221> mi s<L-f eatu re 

<222> (56524) . . (56524) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (56530) . . (56530) 

<223> n is a, c, g, or t 

<220> 

<2 2 1> mi sc_f eat u re 

<222> (56534).. (56534) 

<223> n is a, c, g, or t 

<220> 

<2 2 1> mi s c_f eat u re 

<222> (56539).. (56539) 

<223> n is a, c, g, or t 

<220> 

<221> mi scL-f eatu re 

<222> (56541) . . (56542) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (56546) . . (56546) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56557).. (56557) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (56559).. (56559) 

<223> n is a, c, g, or t 

<220> 

<221> mi scf eatu re 

<222> (56562).. (56562) 

<223> n is a, c, g, or t 

<220> 
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<221> misc_feature 

<222> (56569) .. C56569) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56577) . . (56577) 

<223> n is a, c, g, or t 

<220> 

<221> niisc_feature 

<222> (56595). .(56595) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56598) . . (56598) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (56600).. (56600) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56604).. (56605) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56612) . . (56613) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56615) . . (56615) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eat u re 

<222> (56619).. (56619) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56622) . . (56622) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56643) . . (56643) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (56647) . . (56647) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 
<222> (56649) . . (56649) 



<223> 



n is a, c, g, or t 
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<220> 

<221> misc_feature 

<222> (56652) . . (56652) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56667).. (56670) 

<223> n is a, c, g, or t 

<220> 

<221> tnisc_feature 

<222> (56674) . . (56674) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56676) . . (56676) 

<223> n is a, c, g, or t 

. <220> 

<221> miscL-feature 

<222> (56686) . . (56686) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56694) . . (56694) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (56699) . . (56699) 

<223> n is a, c, g, or t 

<220> 

<221> niis<L.feature 

<222> (56726) . . (56727) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (56734) . . (56734) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (56736) . . (56736) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (56743) . . (56743) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56748) . . (56748) 

<223> n is a, c, g, or t 

<220> 

<221> mis(L.feature 
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<222> (56761) . . (56761) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56777) . . (56777) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (56783) . . (56784) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56786) . . (56786) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56788).. (56788) 

<223> n is a, c, g, or t 

<220> 

<2 2 1> mi s c_f eat u re 

<222> (56790).. (56790) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56799) . . (56799) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56809) . . (56809) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (56814).. (56814) 

<223> n is a, c, g. or t 

<220> 

<221> misc_feature 

<222> (56819) . . (56819) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56823) . . (56823) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (56835) . . (56835) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (56841) . . (56841) 
<223> n is a, c, g, or t 
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<220> 

<221> misc_feature 

<222> (56850) . . (56852) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (56857) .. (56859) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (56868) . . (56868) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (56887) . . (56887) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57475) . . (57475) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (57477) . . (57478) 

<223> n is a, c, g, or t 

<220> 

<221> tnisc_feature 

<222> (57482) . . (57482) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57493) . . (57493) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57495) . . (57495) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (57498).. (57498) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57505) . . (57505) 

<223> n is a, c, g, or t 

<220> 

<221> roisc-feature 

<222> (57513).. (57513) 

<223> n is a, c, g, or t 

<220> 

<221> niisc_feature 

• <222> (57531) . . (57531) 
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<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (57534).. (57534) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57536).. (57536) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (57540).. (57541) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (57548).. (57549) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57551) . . (57551) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57555).. (57555) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57558).. (57558) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57579).. (57579) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (57583).. (57583) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (57585) . . (57585) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57588) . . (57588) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (57603).. (57606) 

<223> n is a, c, g, or t 



<220> 
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<221> misc:_feature 
<222> (57610).. (57610) 
<223> n is a, c, g, or t 

<220> 

<221> mi sc_feature 

<222> (57612) . . (57612) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57622).. (57622) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57630) . . (57630) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (5763 5).. (57635) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (57662).. (57663) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (57670) . . (57670) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (57672) . . (57672) 

<223> n is a, c, g, or t 

<220> 

<221> niisc_feature 

<222> (57679) . . (57679) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57684) . . (57684) 

<223> n is a, c, g. or t 

<220> 

<221> mi sc_f eature 

<222> (57697).. (57697) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (57713) . . (57713) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (57719) . . (57720) 
<223> n is a, c, g, or t 
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<220> 

<221> misc_feature 

<222> (57722).. (57722) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57724) . . (57724) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57726) . . (57726) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (57735).. (57735) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (57745) .. (57745) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (57750) .. (57750) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (57755).. (57755) 

<223> n is a, c, g. or t 

<220> 

<221> misc_feature 

<222> (57759).. (57759) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (57771) . . (57771) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (57777) . . (57777) 

<223> n is a. c, g, or t 

<220> 

<221> misc_feature 

<222> (57786) .. (57788) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 
<222> (57793).. (57795) 

<223> n is a, c, g, or t 

<220> 

<221> mist-feature 
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<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



C57804) . . C57804) 
n is a, c, g, or t 



mi5c_feature 
C57823).. (57823) 
n is a, c, g, or t 



misc_feature 
(58213) . . (58213) 
n is a, c, g, or t 



miscfeature 
(58218) . . (58218) 
n is a, c, g, or t 



inisc_feature 
(58223) . . (58223) 
n is a, c, g, or t 



misc_feature 
(58227) . . (58227) 
n is a, c, g, or t 



misc_feature 
(58239).. (58239) 
n is a, c, g, or t 



tnisc_feature 
(58245) . . (58245) 
n is a, c, g, or t 

in1sc_feature 
(58254) . . (58256) 
n is a, c, g, or t 

misc_feature 
(58261) . . (58263) 
n is a, c, g, or t 



misc_feature 
(58272) . . (58272) 
n is a, c, g, or t 



tnisc_feature 
(58291) . . (58291) 
n is a, c, g, or t 



misc_feature 
(58322) . . (58322) 
n is a, c, g, or t 
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<220> 

<221> mi sc_f eatu re 

<222> C58325).. (58325) 

<223> n is a. c, g, or t 

<220> 

<221> inisc_feature 

<222> (58331).. (58331) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58366) . . (58366) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58369) . . (58369) 

<223> n is a, c, g, or t 

<220> 

<221> fflisc_feature 

<222> (58371) . . (58371) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (58374) . . (58374) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58380) . . (58380) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58396).. (58396) 

<223> n is a, c, g, or t 

<220> 

<221> mistL-feature 

<222> (58402).. (58402) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58406) . . (58406) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (58411).. (58411) 

<223> n is a, c, g. or t 

<220> 

<221> mi sc_f eatu re 
<222> (58413) . . (58414) 

<223> n is a, c, g, or t 

<220> 




page 117 



wo 2004/045553 



PCT/US2003/037041 





022438.43867(pct2) .ST25.txt 

<223> n 1s a, c, g, or t 
<220> 

<221> misc_feature 

<222> (58429).. (58429) 

<223> n 1s a, c, g, or t 

<220> 

<221> ml sc_f eatu re 

<222> (58431) . . (58431) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (58434) . . (58434) 

<223> n Is a, c, g, or t 

<220> 

<221> ml sc_f eatu re 

<222> (58441) . . (58441) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (58449) . . (58449) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58467) . . (58467) 

<223> n is a, c, g, or t 

<220> 

" <221> mi sc_f eatu re 

<222> (58470) . . (58470) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58472) . . (58472) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (58476) . . (58477) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58484) . . (58485) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58487) . . (58487) 

<223> n is a, c, g, or t 

<220> 

<221> mi scf eatu re 

<222> (58491).. (58491) 

<223> n is a, c, g, or t 



<220> 
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<221> misc-feature 

<222> (58494) . . (58494) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc-f eatu re 

<222> (58515) . . (58515) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58519).. (58519) 

<223> n is a, c, g, or t 

<220> 

<221> miscufeature 

<222> (58521).. (58521) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (58524).. (58524) 

<223> n is a, c, g, or t 

<220> 

<2 21> mi sc_f eatu re 

<222> (58539).. (58542) 

<223> n is a, c, g, or t 



<220> 

<221> misc«feature 

<222> (58546).. (58546) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (58548).. (58548) 

<223 n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (58558).. (58558) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58566).. (58566) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (58571).. (58571) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (58598).. (58599) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (58606) ■ . (58606) 



<223> 



n is a, c, g, or t 
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<220> 

<221> misc_feature 

<222> C58608) . . (58608) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> C5861S) . . (58615) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58620) . . (58620) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58633).. (58633) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (58649) . . (58649) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

• <222> (58655).. (58656) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (58658) . . (58658) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (58660).. (58660) 

<223> n is a, c, g, or t 

<220> 

<221> mi scL_f eature 

<222> (58662) . . (58662) 

<223> n is a, c, g, or t 

<220> 

<2 2 1> mis c_f eatu re 

<222> (58671) . . (58671) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (58681) . . (58681) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (58686).. (58686) 

<223> n is a, c, g, or t 

<220> 

<221> mi s CL.f eatu re 
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<222> (58691) . . (58691) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58695) . . (58695) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58707) . . (58707) 

<223> n is a, c, g, or t 

<220> 

<221> tnisc_feature 

<222> (58713) . . (58713) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (58722) . . (58724) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (58729) . . (58731) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (58740) . . (58740) 

<223> n is a, c,. g, or t 

<220> 

<221> misc_feature 

<222> (58759) . . (58759) 

<223> n is a, c, g, or t 

<22Q> 

<221> mi sc_f eatu re 

<222> (59149) . . (59149) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59154).. (59154) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 
<222> (59159) . . (59159) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 
<222> (59163).. (59163) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 
<222> (59175) . . (59175) 
<223> n is a, c, g, or t 
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<220> 

<221> mi sc_f eatu re 

<222> (59181) . . (59181) 

<223> n is a, c, g, or t 

<220> 

<2 21> mi s c_f eatu re 

<222> (59190) .. (59192) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (59197) . . (59199) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (59208).. (59208) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (59227) . . (59227) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59258).. (59258) 

<223> n is a, c, g, or t 

<220> 

<22 1> mi s c_f eat u re 

<222> (59261) . . (59261) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59267) . . (59267) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (59302) . . (59302) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59305) . . (59305) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59307) . . (59307) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59310) . . (59310) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (59316) . . (59316) 
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<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59332) . . (59332) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (59338),. (59338) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59342) • . (59342) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (59347). ,(59347) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59349).. (59350) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (59354).. (59354) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59365) . ■ (59365) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (59367) -.(59367) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (59370).. (59370) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59377) . . (59377) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (59385) . . (59385) 

<223> n is a, g, or t 

<220> 

<221> mi sc^f eature 

<222> (59403) . - (59403) 

<223> n is a, c, g, or t 



<220> 
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<221> tnisc_feature 

<222> (59406).. C59406) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59408) . . (59408) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59412) . . (59413) 

<223> n is a, c, g, or t 

<220> 

<22 1> mi sc_f eatu re 

<222> (59420) . . (59421) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59423) . . (59423) 

<223> n is a, c, g, or t 

<220> 

<2 2 1> mi s c_f eatu re 

<222> (59427) . . (59427) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59430) . . (59430) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59451) . . (59451) 

<223> n is a, c, g, or ti 

<220> 

<221> fflisc_feature 

<222> (59455) . . (59455) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59457) . . (59457) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59460) . . (59460) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59475) . . (59478) 

<223> n is a, c, g, or t 

<22G> 

<221> miscfeature 

<222> (59482) . . (59482) 

<223> n is a, c, g, or t 
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<220> 

<221> misc_feature 

<222> (59484) . . (59484) 

<223> n 1s a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59494).. (59494) 

<223> n is a, c, g, or t 

<220> 

<221> miscL-feature 

<222> (59502) . . (59502) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59507) . . (59507) 

<223> n Is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59534).. (59535) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (59542) . . (59542) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (59544) . . (59544) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (59551) . . (59551) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (59556) . . (59556) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (59569) . . (59569) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (59585).. (59585) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (59591).. (59592) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
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<222> C59594) . . (59594) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (595963 . . (59596) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59598) . , (59598) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59607) . . (59607) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (59617) . . (59617) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (59622) . . (59622) 

<223> n is a, c, g, or t 

<220> 

<221> fflisc_feature 

<222> (59627) . . (59627) 

<223> n is a, c, g, or t 

<22a> 

<221> misc_feature 

<222> (59631).. (59631) 

<223>' n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (59643) . . (59643) 

<223> n is a, c, g, or t 

<220> 

<221> ntisc_feature 

<222> (59649).. (59649) 

<223> n is a, c, g, or t 

<220> 

<221> miscfeature 

<222> (59658).. (59660) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (59665) . . (59667) 

<223> n is a, c, g, or t 

<220> 

<221> raisc-feature 

<222> (59676) . . (59676) 

<223> n is a, c, g, or t 
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<220> 

<221> misc_feature 

<222> (59695).. (59695) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60082) . . (60082) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (60087) . . (60087) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60092) . . (60092) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60096) . . (60096) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (60108) . . (60108) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60114) . . (60114) 

<223> n is a, c, g, or t 

<220> 

<223> misc.feature 

<222 (601i3) . . (60125) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60130) . . (60132) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (60141) . . (60141) 

<223> n is a, c, g, or t 

<220> 

<221> niisc_feature 

<222> (60160) . . (60160) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (60191) . . (60191) 

<223> n is a, c, g, or t 

<220> 



<221> 
<222> 



misc_feature 
(60194).. (60194) 
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<223> n is a, c, g, or t 
<220> 

<221> misc^feature 

<222> (60200) . ■ (60200) 

<223> n 1s a, c, or t 

<220> 

<221> misc_feature 

<222> (60235).. (60235) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60238) . . (60238) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (60240).. (60240) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (60243) . . (60243) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60249) . . (60249) 

<223> n is a, c, g* or t 

<220> 

<221> misc-feature 

<222> (60265) . . (60265) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (60271) . . (60271) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60275) . . (60275) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60280) (60280) 

<223> n is a, c, g, or t 

<220> 

<221> misc^feature 

<222> (60282) ..(60283) 

<223> n is a* c, g, or t 

<220> 

<221> miscL-feature 

<222> (60287) . - (60287) 

<223> n is a, c, g, or t 

<220> 
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<221> misc_feature 

<222> (60298) . . (60298) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60300) . . (60300) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (60303) . . (60303) 

<223> n is a, c, g, or t 

<220> 

<221> tnisc_feature 

<222> (60310) . . (60310) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (60318) . . (60318) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60336) . . (60336) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60339) . . (60339) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60341) . . (60341) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (60345) . . (60346) 

<223> n is a, c, g, or t . 

<220> 

<221> misc_feature 
<222> (60353) . . (60354) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (60356).. (60356) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (60360) . . (60360) 
<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 
<222> (60363) . . (60363) 



<223> 



(1 is a, c. g. or t 
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<220> 

<221> m1sc_feature 

<222> (60384) . . (60384) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60388) . . (60388) 

<223> n is a, c, g, or t 

<220> 

<221> raisc_feature 

<222> (60390) . . (60390) 

<223> n is a, c, g, or t 

<220> 

<221> rai sc_f eature 

<222> (60393) . . (60393) 

<223> n is a, c, g, or t 

<220> 

<221> rai sc_f eature 

<222> (60408) . . (60411) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (60415) . . (60415) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (60417) . . (60417) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (60427) . . (60427) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (60435) . . (60435) 

<223>- n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (60440) . . (60440) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (60467) . . (60468) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (60475) . . (60475) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 
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<222> (60477) . . (60477) 
<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (60484) . . (60484) 

<223> n is a, c, g, or t 

<220> 

<221> misc-feature 

<222> (60489).. (60489) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (60502) . . (60502) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (60518) . . (60518) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60524) . . (60525) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60527) . . (60527) 

<223> n is a, c, g, or t 

<220> 

<221> niisc_feature 

<222> (60529).. (60529) 

<223> n is a, c, g, or t 

<220> 

<221> raiscfeature 

<222> (60531) . . (60531) 

<223> n is a. c, g, or t 

<220> 

<221> misc_feature 

<222> (50540) . . (60540) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60550) . . (60550) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (60555).. (60555) 

<223> n is a, c, g. or t 

<220> 

<221> misc_feature 

<222> (60560) . . (60560) 

<223> n is a, c, g, or t 
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<220> 

<22 1> mi sc_f eat u re 

<222> C60564) . . (60564) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60576) . . (60576) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60582) . . (60582) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (60591) . . (60593) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (60598) . . (60600) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60609) . . (60609) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (60628) . . (60628) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (61018) . . (61018) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (61023) . . (61023) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (61028) . . (61028) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (61032).. (61032) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (61044) . . (61044) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (61050) . . (61050) 
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<223> n is a, c, g, or t 

<220> 

<221> roisc_feature 

<222> (61059) . . (61061) 

<223> n is a, c, g, or t 

<220> 

<221> inisc_feature 

<222> (61066) .. (61068) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (61077).. (61077) 

<223> n Is a, c, g, or t ' 

<220> 

<221> mi sc_f eature 

<222> (61096) . . (61096) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (61127) . . (61127) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (61130) . . (61130) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (61136) . . (61136) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (61171) . . (61171) 

<223> n is a. c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (61174) . . (61174) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (61176) . . (61176) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (6U79) . . (61179) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 
<222> (6118 5).. (61185) 

<223> n is a, c, g, or t 



<220> 
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<221> misc_feature 

<222> (61201) . . C61201) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (61207) . . (61207) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (61211) . . (61211) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (62620) . . (62620) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (62622) . . (62623) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_feature 

<222> (62627).. (62627) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_feature 

<222> (62638) .. (62638) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_feature 

<222> (62640) . . (62640) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (62643) . . (62643) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (62650) . . (62650) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (62658) . . (62658) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (62676).. (62676) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (62679) . . (62679) 

<223> n is a, c, g, or t 
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<220> 

<221> niisc_feature 

<222> (62681) . . (62681) 

<223> n is a, c, g, or t 

<220> 

<221> misc_feature 

<222> (62685).. (62686) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (62693) . . (62694) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (62696) . . (62696) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eatu re 

<222> (62700) . . (62700) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (62703) . . (62703) 

<223> n is a, c, g, or t 

<220> 

<221> mi sc_f eature 

<222> (62724).. (62724) 

<223> n is a, c, g, or t 



aagigttgca caattccccc aacctccata catacggcag ctcttctaga cacaggtttt 
cccaggtcaa atgcggggac cccagccata tctcccaccc tgagaaattt tggagtttca 
gggagctcag aagctctgca gaggccaccc tctctgaggg gattcttctt agacctccat 
ccagaggcaa atgttgacct gtccatgctg aaaccctcag gccttcctgg gtcatcttct 
cccacccgct ccttgatgac agggagcagg agcactaaag ccacaccaga aatggattca 
ggactgacag gagccacctt gtcacctaag acatctacag gtgcaatcgt ggtgacagaa 
catactctgc cctttacttc cccagataag accttggcca gtcctacatc ttcggttgtg 
ggaagaacca cccagtcttt gggggtgatg tcctctgctc tccctgagtc aacctctaga 
ggaatgacac actccgagca aagaaccagc ccatcgctga gtccccaggt caatggaact 
ccctctagga actaccctgc tacaagcatg gtttcaggat tgagttcccc aaggaccagg 
accagttcca cagaaggaaa ttttaccaaa gaagcatcta catacacact cactgtagag 
accacaagtg gcccagtcac tgagaagtac acagtcccca ctgagacctc aacaactgaa 
ggtgacagca cagagacccc ctgggacaca agatatattc ctgtaaaaat cacatctcca 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
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gagagtccta gcactctcag cctcccaagt gtctctggag tcaaaaccac attttcttca 
tctactcctt ccactcatct atttactagt ggagaagaaa cagaggaaac ttcgaatcca 
tctgtgtctc aacctgagac ttctgtttcc agagtaagga ccaccttggc cagcacctct 
gtccctaccc cagtattccc caccatggac acctggccta cacgttcagc tcagttctct 
tcatcccacc tagtgagtga gctcagagct acgagcagta cctcagttac aaactcaact 
ggttcagctc ttcctaaaat atctcacctc actgggacgg caacaatgtc acagaccaat 
agagacacgt ttaatgactc tgctgcaccc caaagcacaa cttggccaga gactagtccc 
agattcaaga cagggttacc ttcagcaaca accactgttt caacctctgc cacttctctc 
tctgctactg taatggtctc taaattcact tctccagcaa ctagttccat ggaagcaact 
tctatcaggg aaccatcaac aaccatcctc acaacagaga ccacgaatgg cccaggctct 
atggctgtgg cttctaccaa catcccaatt ggaaagggct acattactga aggaagattg 
gacacaagcc atctgcccat tggaaccaca gcttcctctg agacatctat ggattttacc 
atggccaaag aaagtgtctc aatgtcagta tctccatctc agtccatgga tgctgctggc 
tcaagcactc caggaaggac aagccaattc gttgacacat tttctgatga tgtctatcat 
ttaacatcca gagaaattac aatacctaga gatggaacaa gctcagctct gactccacaa 
atgactgcaa ctcaccctcc atctcctgat cctggctctg ctagaagcac ctggcttggc 
atcttgtcct catctccttc ttctcctact cccaaagtca caatgagctc cacattttca 
actcagagag tcaccacaag catgataatg gacacagttg aaactagtcg gtggaacatg 
cccaacttac cttccacgac ttccctgaca ccaagtaata ttccaacaag tggtgccata 
ggaaaaagca ccctggttcc cttggacact ccatctccag ccacatcatt ggaggcatca 
gaagggggac ttccaaccct cagcacctac cctgaatcaa caaacacacc cagcatccac 
ctcggagcac acgctagttc agaaagtcca agcaccatca aacttaccat ggcttcagta 
gtaaaacctg gctcttacac acctctcacc ttcccctcaa tagagaccca cattcatgta 
tcaacagcca gaatggctta ctcttctggg tcttcacctg agatgacagc tcctggagag 
actaacactg gtagtacctg ggaccccacc acctacatca ccactacgga tcctaaggat 
acaagttcag ctcaggtctc tacaccccac tcagtgagga cactcagaac cacagaaaac 
catccaaaga cagagtccgc caccccagct gcttactctg gaagtcctaa aatctcaagt 
tcacccaatc tcaccagtcc ggccacaaaa gcatggacca tcacagacac aactgaacac 
tccactcaat tacattacac aaaattggca gaaaaatcat ctggatttga gacacagtca 
gctccaggac ctgtctctgt agtaatccct acctccccta ccattggaag cagcacattg 
gaactaactt ctgatgtccc aggggaaccc ctggtccttg ctcccagtga gcagaccaca 
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atcactctcc ccatggcaac atggctgagt accagtttga cagaggaaat ggcttcaaca 4620 

gaccttgata tttcaagtcc aagttcaccc atgagtacat ttgctatttt tccacctatg 4680 

tccacacctt ctcatgaact ttcaaagtca gaggcagata ccagtgccat tagaaataca 4740 

gattcaacaa cgttggatca gcacctagga atcaggagtt tgggcagaac tggggactta 4800 

acaactgttc ctatcacccc actgacaacc acgtggacca gtgtgattga acactcaaca 4860 

caagcacagg acaccctttc tgcaacgatg agtcctactc acgtgacaca gtcactcaaa 4920 

gatcaaacat ctataccagc ctcagcatcc ccttcccatc ttactgaagt ctaccctgag 4980 

ctcgggacac aagggagaag ctcctctgag gcaaccactt tttggaaacc atctacagac 5040 

acactgtcca gagagattga gactggccca acaaacattc aatccactcc acccatggac 5100 

aacacaacaa cagggagcag tagtagtgga gtcaccctgg gcatagccca ccttcccata 5160 

ggaacatcct ccccagctga gacatccaca aacatggcac tggaaagaag aagttctaca 5220 

gccactgtct ctatggctgg gacaatggga ctccttgtta ctagtgctcc aggaagaagc 5280 

atcagccagt cattaggaag agtttcctct gtcctttctg agtcaactac tgaaggagtc 5340 

acagattcta gtaagggaag cagcccaagg ctgaacacac agggaaatac agctctctcc 5400 

tcctctcttg aacccagcta tgctgaagga agccagatga gcacaagcat ccctctaacc 5460 

tcatctccta caactcctga tgtggaattc atagggggca gcacattttg gaccaaggag 5520 

gtcaccacag ttatgacctc agacatctcc aagtcttcag caaggacaga gtccagctca 5580 

gctaccctta tgtccacagc tttgggaagc actgaaaata caggaaaaga aaaactcaga 5640 

actgcctcta tggatcttcc atctccaact ccatcaatgg aggtgacafcc atggatttct 5700 

ctcactctca gtaatgcccc caataccaca gattcacttg acctcagcca tggggtgcac 5760 

accagctctg cagggacttt ggccactgac aggtcattga atactggtgt cactagagcc 5820 

tccagattgg aaaacggctc tgatacctct tctaagtccc tgtctatggg aaacagcact 5880 

cacacttcca tgactgacac agagaagagt gaagtgtctt cttcaatcca tccccgacct 5940 

gagacctcag ctcctggagc agagaccact ttgacttcca ctcctggaaa cagggccata 6000 

agcttaacat tgcctttttc atccattcca gtggaagaag tcatttctac aggcataacc 6060 

tcaggaccag acatcaactc agcacccatg acacattctc ccatcacccc accaacaatt 6120 

gtatggacca gtacaggcac aattgaacag tccactcaac cactacatgc agtttcttca 6180 

gaaaaagttt ctgtgcagac acagtcaact ccatatgtca actctgtggc agtgtctgct 6240 

tcccctaccc atgagaattc agtctcttct ggaagcagca catcctctcc atattcctca 6300 

gcctcacttg aatccttgga ttccacaatc agtaggagga atgcaatcac ttcctggcta 5360 

tgggacctca ctacatctct ccccactaca acttggccaa gtactagttt atctgaggca 6420 

ctgtcctcag gccattctgg ggtttcaaac ccaagttcaa ctacgactga atttccactc 6480 
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ttttcagctg catccacatc tgctgctaag caaagaaatc cagaaacaga gacccatggt 
ccccagaata cagccgcgag tactttgaac actgatgcat cctcggtcac aggtctttct 
gagactcctg tgggggcaag tatcagctct gaagtccctc ttccaatggc cataacttct 
agatcagatg tttctggcct tacatctgag agtactgcta acccgagttt aggcacagcc 
tcttcagcag ggaccaaatt aactaggaca atatccctgc ccacttcaga gtctttggtt 
tcctttagaa tgaacaagga tccatggaca gtgtcaatcc ctttggggtc ccatccaact 
actaatacag aaacaagcat cccagtaaac agcgcaggtc cacctggctt gtccacagta 
gcatcagatg taattgacac accttcagat ggggctgaga gtattcccac tgtctccttt 
tccccctccc ctgatactga agtgacaact atctcacatt tcccagaaaa gacaactcat 
tcatttagaa ccatttcatc tctcactcat gagttgactt caagagtgac acctattcct 
ggggattgga tgagttcagc tatgtctaca aagcccacag gagccagtcc ctccattaca 
ctgggagaga gaaggacaat cacctctgct gctccaacca cttcccccat agttctcact 
gctagtttca cagagaccag cacagtttca ctggataatg aaactacagt aaaaacctca 
gatatccttg acgcacggaa aacaaatgag ctcccctcag atagcagttc ttcttctgat 
ctgatcaaca cctccatagc ttcttcaact atggatgtca ctaaaacagc ctccatcagt 
cccactagca tctcaggaat gacagcaagt tcctccccat ctctcttctc ttcagataga 
ccccaggttc ccacatctac aacagagaca aatacagcca cctctccatc tgtttccagt 
aacacctatt ctcttgatgg gggctccaat gtgggtggca ctccatccac tttaccaccc 
tttacaatca cccaccctgt cgagacaagc tcggccctat tagcctggtc tagaccagta 
agaactttca gcaccatggt cagcactgac actgcctccg gagaaaatcc tacctctagc 
aattctgtgg tgacttctgt tccagcacca ggtacatggg ccagtgtagg cagtactact 
gacttacctg ccatgggctt tctcaagaca agtcctgcag gagaggcaca ctcacttcta 
gcatcaacta ttgaaccagc cactgccttc actccccatc tctcagcagc agtggtcact 
ggatccagtg ctacatcaga agccagtctt ctcactacga gtgaaagcaa agccattcat 
tcttcaccac agaccccaac tacacccacc tctggagcaa actgggaaac ttcagctact 
cctgagagcc ttttggtagt cactgagact tcagacacaa cacttacctc aaagattttg 
gtcacagata ccatcttgtt ttcaactgtg tccacgccac cttctaaatt tccaagtacg 
gggactctgt ctggagcttc cttccctact ttactcccgg acactccagc catccctctc 
actgccactg agccaacaag ttcattagct acatcctttg attccacccc actggtgact 
atagcttcgg atagtcttgg cadagtccca gagactaccc tgaccatgtc agagacctca 
aatggtgatg cactggttct taagacagta agtaacccag ataggagcat ccctggaatc 
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actatccaag gagtaacaga aagtccactc catccttctt ccacttcccc ctctaagatt 8400 

gttgctccac ggaatacaac ctatgaaggt tcgatcacag tggcactttc tactttgcct 8460 

gcgggaacta ctggttccct tgtattcagt cagagttctg aaaactcaga gacaacggct 8520 

ttggtagact catcagctgg gcttgagagg gcatctgtga tgccactaac cacaggaagc 8580 

cagggtatgg ctagctctgg aggaatcaga agtgggtcca ctcactcaac tggaaccaaa 8640 

acattttctt ctctccctct gaccatgaac ccaggtgagg ttacagccat gtctgaaatc 8700 

accacgaaca gactgacagc tactcaatca acagcaccca aagggatacc tgtgaagccc 8760 

accagtgctg agtcaggcct cctaacacct gtctctgcct cctcaagccc atcaaaggcc 8820 

tttgcctcac tgactacagc tcccccatca acttggggga tcccacagtc taccttgaca 8880 

tttgagtttt ctgaggtccc aagtttggat actaagtccg cttctttacc aactcctgga 8940 

cagtccctga acaccattcc agactcagat gcaagcacag catcttcctc actgtccaag 9000 

tctccagaaa aaaacccaag ggcaaggatg atgacttcca caaaggccat aagtgcaagc 9060 

tcatttcaat caacaggttt tactgaaacc cctgagggat ctgcctcccc ttctatggca 9120 

gggcatgaac ccagagtccc cacttcagga acaggggacc ctagatatgc ctcagagagc 9180 

atgtcttatc cagacccaag caaggcatca tcagctatga catcgacctc tcttgcatca 9240 

aaactcacaa ctctcttcag cacaggtcaa gcagcaaggt ctggttctag ttcctctccc 9300 

ataagcctat ccactgagaa agaaacaagc ttcctttccc ccactgcatc cacctccaga 9360 

aagacttcac tatttcttgg gccttccatg gcaaggcagc ccaacatatt ggtgcatctt 9420 

cagacttcag ctctgacact ttctccaaca tccactctaa atatgtccca ggaggagcct 9480 

cctgagttaa cctcaagcca gaccattgca gaagaagagg gaacaacagc tgaaacacag 9540 

acgttaacct tcacaccatc tgagacccca acatccttgt tacctgtctc ttctcccaca 9600 

gaacccacag ccagaagaaa gagttctcca gaaacatggg caagctctat ttcagttcct 9660 

gccaagacct ccttggttga aacaactgat ggaacgctag tgaccaccat aaagatgtca 9720 

agccaggcag cacaaggaaa ttccacgtgg cctgccccag cagaggagac ggggaccagt 9780 

ccagcaggca catccccagg aagcccagaa gtgtctacca ctctcaaaat catgagctcc 9840 

aaggaaccca gcatcagccc agagatcagg tccactgtgc gaaattctcc ttggaagact 9900 

ccagaaacaa ctgttcccat ggagaccaca gtggaaccag tcacccttca gtccacagcc 9960 

ctaggaagtg gcagcaccag catctctcac ctgcccacag gaaccacatc accaaccaag 10020 

tcaccaacag aaaatatgtt ggctacagaa agggtctccc tctccccatc cccacctgag 10080 

gcttggacca acctttattc tggaactcca ggagggacca ggcagtcact ggccacaatg 10140 

tcctctgtct ccctagagtc accaactgct agaagcatca cagggactgg tcagcaaagc 10200 

agtccagaac tggtttcaaa gacaactgga atggaattct ctatgtggca tggctctact 10260 
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tctgtttgat 


11400 


tctctgtcca 


ctcttgactg 


cgacactqgg 


agatctctgt 


catcagccac 


agccactacc 


11460 


tcagctcctc 


aaoggaccac 


aactccccag 


gaactcactt 


tggaaaccat 


gatcagccca 


11520 


gctacctcac 


agttgccctt 


ctctataggg 


cacattacaa 


gtgcagtcac 


accagctgca 


11580 


atggcaagga 


gctctggagt 


tactttttca 


agaccagatc 


ccacaagcaa 


aaaggcagag 


11640 


cagacttcca 


ctcagcttcc 


caccaccact 


tctgcacatc 


cagggcaggt 


gcccagatca 


11700 


gcagcaacaa 


ctctggatgt 


gatcccacac 


acagcaaaaa 


ctccagatgc 


aacttttcag 


11760 




ctyaCayuud 


^a^9 9i/*9nan 
LaC-clclUciydy 


fi** a a n a rt r*^'a 
yCadyayLLa 


Ccl^C i.y civ. 


^ LyyaaLyay 




aaagaaaaat 


caaccccaag 


tgcaccttgg 


atcactgaga 


tgatgaattc 


tgtctcagaa 


11880 


gataccatca 


aggaggttac 


cagctcctcc 


agtgtattaa 


aggaccctga 


atacgctgga 


11940 


cataaacttg 


gaatctggga 


cgacttcatc 


cccaagtttg 


gaaaagcagc 


ccatatgaga 


12000 


gagttgcccc 


ttctgagtcc 


accacaggac 


aaagaggcaa 


ttcacccttc 


tacaaacaca 


12060 


gtagagacca 


caggctgggt 


cacaagttcc 


gaacatgctt 


ctcattccac 


tatcccagcc 


12120 
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cacatctcca gtggttacaa cctccaccag ggaacaagca 


12180 


atagtttcta 


tgtcaacaac 


cacatggcca 


gagtctacaa 


gggctagaac 


agagcctaat 


12240 


tccttcttga 


ctattgaact 


gagggacgtc 


agcccttaca 


tggacaccag 


ctcaaccaca 


12300 


caaacaagta 


ttatctcttc 


cccaggttcc 


actgcgatca 


ccaaggggcc 


tagaacagaa 


12360 


attacctcct 


ctaagagaat 


atccagctca 


ttccttgccc 


agtctatgag 


gtcgtcagac 


12420 


agcccctcag 


aagccatcac 


caggctgtct 


aactttcctg 


ccatgacaga 


atctggagga 


12480 


atgatccttg 


ctatgcaaac 


aagtccacct 


ggcgctacat 


cactaagtgc 


acctactttg 


12540 


gatacatcag 


ccacagcctc 


ctggacaggg 


actccactgg 


ctacgactca 


gagatttaca 


12600 


tactcagaga 


agaccactct 


ctttagcaaa 


ggtcctgagg 


atacatcaca 


gccaagccct 


12660 


ccctctgtgg 


aagaaaccag 


ctcttcctct 


tccctggtac 


ctatccatgc 


tacaacctcg 


12720 


ccttccaata 


ttttgttgac 


atcacaaggg 


cacagtccct 


cctctactcc 


acctgtgacc 


12780 


tcagttttct 


tgtctgagac 


ctctggcctg 


gggaagacca 


cagacatgtc 


gaggataagc 


12840 


ttggaacctg 


gcacaagttt 


acctcccaat 


ttgagcagta 


cagcaggtga 


ggcgttatcc 


12900 


acttatgaag 


cctccagaga 


tacaaaggca 


attcatcatt 


ctgcagacac 


agcagtgacg 


12960 


aatatggagg 


caaccagttc 


tgaatattct 


cctatcccag 


gccatacaaa 


gccatccaaa 


13020 


gccacatctc 


cattggttac 


ctcccacatc 


atgggggaca 


tcacttcttc 


cacatcagta 


13080 


tttggctcct 


ccgagaccac 


agagattgag 


acagtgtcct 


ctgtgaacca 


gggacttcag 


13140 


gagagaagca 


catcccaggt 


ggccagctct 


gctacagaga 


caagcactgt 


cattacccat 


13200 


gtgtctagtg 


gtgatgctac 


tactcatgtc 


accaagacac 


aagccacttt 


ctctagcgga 


13260 


acatccatct 


caagccctca 


tcagtttata 


acttctacca 


acacawCtac 


agatgtgagc 


13320 


accaacccct 


ccacctctct 


gataatgaca 


gaatcttcag 


gagtgaccat 


caccacccaa 


13380 


acaggtccta 


ctggagctgc 


aacacagggt 


ccatatctct 


tggacacatc 


aaccatgcct 


13440 


tacttgacag 


agactccatt 


agctgtgact 


ccagatttta 


tgcaatcaga 


gaagaccact 


13500 


ctcataagca 


aaggtcccaa 


ggatgtgacc 


tggacaagcc 


ctccctctgt 


ggcagaaacc 


13560 


agctatccct 


cttccctgac 


acctttcttg 


gtcacaacca 


tacctcctgc 


cacttccacg 


13620 


ttacaagggc 


aacatacatc 


ctctcctgtt 


tctgcgactt 


cagttcttac 


ctctggactg 


13680 


gtgaagacca 


cagatatgtt 


gaacacaagc 


atggaacctg 


tgaccaattc 


acctcaaaat 


13740 


ttgaacaatc 


catcaaatga 


gatactggcc 


actttggcag 


ccaccacaga 


tatagagact 


13800 


attcatcctt 


ccataaacaa 


agcagtgacc 


aatatgggga 


ctgccagttc 


agcacatgta 


13860 


ctgcattcca 


ctctcccagt 


cagctcagaa 


ccatctacag 


ccacatctcc 


aatggttcct 


13920 


gcctccagca 


tgggggacgc 


tcttgcttct 


atatcaatac 


ctggttctga 


gaccacagac 


13980 


attgagggag 


agccaacatc 


ctccctgact 


gctggacgaa aagagaacag 
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gagatgaact caactacaga gtcaaacatc atcctctcca atgtgtctgt gggggctatt 
actgaagcca caaaaatgga agtcccctct tttgatgcaa cattcatacc aactcctgct 
cagtcaacaa agttcccaga tattttctca gtagccagca gtagactttc aaactctcct 
cccatgacaa tatctaccca catgaccacc acccagacag ggtcttctgg agctacatca 
aagattccac ttgccttaga cacatcaacc ttggaaacct cagcagggac tccatcagtg 
gtgactgagg ggtttgccca ctcaaaaata accactgcaa tgaacaatga tgtcaaggac 
gtgtcacaga caaaccctcc ctttcaggat gaagccagct ctccctcttc tcaagcacct 
gtccttgtca caaccttacc ttcttctgtt gctttcacac cgcaatggca cagtacctcc 
tctcctgttt ctatgtcctc agttcttact tcttcactgg taaagaccgc aggcaaggtg 
gatacaagct tagaaacagt gaccagttca cctcaaagta tgagcaacac tttggatgac 
atatcggtca cttcagcagc caccacagat atagagacaa cgcatccttc cataaacaca 
gtagttacca atgtggggac caccggttca gcatttgaat cacattctac tgtctcagct 
tacccagagc catctaaagt cacatctcca aatgttacca cctccaccat ggaagacacc 
acaatttccc gatcaatacc taaatcctct aagactacaa gaactgagac tgagacaact 
tcctccctga ctcctaaact gagggagacc agcatctccc aggagatcac ctcgtccaca 
gagacaagca ctgttcctta caaagagctc actggtgcca ctaccgaggt atccaggaca 
gatgtcactt cctctagcag tacatccttc cctggccctg atcagtccac agtgtcacta 
gacatctcca cagaaaccaa caccaggctg tctacctccc caataatgac agaatctgca 
gaa. taacca tcaccaccca aacaggtcct catggggcta catcacagga tacttttacc 
atggacccat caaatacaac cccccaggca gggatccact cagctatgac tcatggattt 
tcacaattgg atgtgaccac tcttatgagc agaattccac aggatgtatc atggacaagt 
cctccctctg tggataaaac cagctccccc tcttcctttc tgtcctcacc tgcaatgacc 
acaccttccc tgatttcttc taccttacca gaggataagc tctcctctcc tatgacttca 
cttctcacct ctggcctagt gaagattaca gacatattac gtacacgctt ggaacctgtg 
accagctcac ttccaaattt cagcagcacc tcagataaga tactggccac ttctaaagac 
agtaaagaca caaaggaaat ttttccttct ataaacacag aagagaccaa tgtgaaagcc 
aacaactctg gacatgaatc ccattcccct gcactggctg actcagagac acccaaagcc 
acaactcaaa tggttatcac caccactgtg ggagatccag ctccttccac atcaatgcca 
gtgcatggtt cctctgagac tacaaacatt aagagagagc caacatattt cttgactcct 
agactgagag agaccagtac ctctcaggag tccagctttc ccacggacac aagttttcta 
ctttccaaag tccccactgg tactattact gaggtctcca gtacaggggt caactcttct 



14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 



Page 143 



wo 2004/045553 _ PCT/US2003/037041 



agcaaaattt 


ccaccccaga 
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ccatgataag tccacagtgc cacctgacac cttcacagga 


15960 


gagatcccca 


gggtcttcac 


ctcctctatt 


aagacaaaat 


ctgcagaaat 


gacgatcacc 


16020 


acccaagcaa 


gtcctcctga 


gtctgcatcg 


cacagtaccc 


ttcccttgga 


cacatcaacc 


16080 


acactttccc 


agggagggac 


tcattcaact 


gtgactcagg 


gattcccata 


ctcagaggtg 


16140 


accactctca 


tgggcatggg 


tcctgggaat 


gtgtcatgga 


tgacaactcc 


ccctgtggaa 


16200 


gaaaccagct 


ctgtgtcttc 


cctgatgtct 


tcacctgcca 


tgacatcccc 


ttctcctgtt 


16260 


tcctccacat 


caccacagag 


catcccctcc 


tctcctcttc 


ctgtgactgc 


acttcctact 


16320 


tctgttctgg 


tgacaaccac 


agatgtgttg 


ggcacaacaa 


gcccagagtc 


tgtaaccagt 


16380 


tcacctccaa 


atttgagcag 


catcactcat 


gagagaccgg 


ccacttacaa 


agacactgca 


16440 


cacacagaag 


ccgccatgca 


tcattccaca 


aacaccgcag 


tgaccaatgt 


agggacttcc 


16500 


gggtctggac 


ataaatcaca 


atcctctgtc 


ctagctgact 


cagagacatc 


gaaagccaca 


16560 


cctctgatga 


gtaccacctc 


caccctgggg 


gacacaagtg 


tttccacatc 


aactcctaat 


16620 


atctctcaga 


ctaaccaaat 


tcaaacagag 


ccaacagcat 


ccctgagccc 


tagactgagg 


16680 


gagagcagca 


cgtctgagaa 


gaccagctca 


acaacagaga 


caaatactgc 


cttttcttat 


16740 


gtgcccacag 


gtgctattac 


tcaggcctcc 


agaacagaaa 


tctcctctag 


cagaacatcc 


16800 


atctcagacc 


ttgatcggcc 


cacaatagca 


cccgacatct 


ccacaggaat 


gatcaccagg 


16860 


ctcttcacct 


cccccatcat 


gacaaaatct 


gcagaaatga 


ccgtcaccac 


tcaaacaact 


16920 


actcctgggg 


ctacatcaca 


gggtatcctt 


ccttgggaca 


catcaaccac 


acttttccag 


16980 


ggagggactc 


attcaaccgt 


gtctcaggga 


ttcccacact 


cagagataac 


cactcttcgg 


17040 


agcagaaccc 


ctggagatgt 


Qtcatggatg 


acaactcccc 


ctgtggaaga 


aaccagctct 


17100 


gggttttccc 


tgatgtcacc 


ttccatgaca 


tccccttctc 


ctgtttcctc 


cacatcacca 


17160 


gagagcatcc 


cctcctctcc 


tctccctgtg 


actgcacttc 


ttacttctgt 


tctggtgaca 


17220 


accaccaatg 


tattgggcac 


aacaagccca 


gagaccgtaa 


cgagttcacc 


tccaaattta 


17280 


agcagcccca 


cacaggagag 


actgaccact 


tacaaagaca 


ctgcgcacac 


agaagccatg 


17340 


catgcttcca 


tgcatacaaa 


cactgcagtg 


gccaacgtcg 


ggacctccat 


ttctggacat 


17400 


gaatcacaat 


cttctgtccc 


agctgattca 


cacacatcca 


aagccacatc 


tccaatgggt 


17460 


atcaccttcg 


ccatggggga 


tacaagtgtt 


tctacatcaa 


ctcctgcctt 


ctttgagact 


17520 


agaattcaga 


ctgaa'tcaac 


atcctctttg 


attcctggat 


taagggacac 


caggacgtct 


17580 


gaggagatca 


acactgtgac 


agagaccagc 


actgtccttt 


cagaagtgcc 


cactactact 


17640 


actactgagg 


tctccaggac 


agaagttatc 


acttccagca 


gaacaaccat 


ctcagggcct 


17700 


gatcattcca 


aaatgtcacc 


ctacatctcc 


acagaaacca 


tcaccaggct 


ctccactttt 


17760 


ccttttgtaa 


caggatccac 


agaaatggcc 


atcaccaacc aaacaggtcc 
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atctcacagg ctacccttac cctggacaca tcaagcacag cttcctggga agggactcac 17880 

tcacctgtga ctcagagatt tccacactca gaggagacca ctactatgag cagaagtact 17940 

aagggcgtgt catggcaaag ccctccctct gtggaagaaa ccagttctcc ttcttcccca 18000 

gtgcctttac ctgcaataac ctcacattca tctctttatt ccgcagtatc aggaagtagc 18060 

cccacttctg ctctccctgt gacttccctt ctcacctctg gcaggaggaa gaccatagac 18120 

atgttggaca cacactcaga acttgtgacc agctccttac caagtgcaag tagcttctca 18180 

ggtgagatac tcacttctga agcctccaca aatacagaga caattcactt ttcagagaac 18240 

acagcagaaa ccaatatggg gaccaccaat tctatgcata aactacattc ctctgtctca 18300 

atccactccc agccatccgg acacacacct ccaaaggtta ctggatctat gatggaggac 18360 

gctattgttt ccacatcaac acctggttct cctgagacta aaaatgttga cagagactca 18420 

acatcccctc tgactcctga actgaaagag gacagcaccg ccctggtgat gaactcaact 18480 

acagagtcaa acactgtttt ctccagtgtg tccctggatg ctgctactga ggtctccagg 18540 

gcagaagtca cctactatga tcctacattc atgccagctt ctgctcagtc aacaaagtcc 18600 

ccagacattt cacctgaagc cagcagcagt cattctaact ctcctccctt gacaatatct 18660 

acacacaaga ccatcgccac acaaacaggt ccttctgggg tgacatctct tggccaactg 18720 

accctggaca catcaaccat agccacctca gcaggaactc catcagccag aactcaggat 18780 

tttgtagatt cagaaacaac cagtgtcatg aacaatgatc tcaatgatgt gttgaagaca 18840 

agccctttct ctgcagaaga agccaactct ctctcttctc aggcacctct ccttgtgaca 18900 

acctcacctt ctcctgtaac ttccacattg caagagcaca gtacctcctc tcttgtttct 18960 

gtgacctcag tacccacccc tacactggcg aagatcacag acatggacac aaacttagaa 19020 

cctgtgactc gttcacctca aaatttaagg aacaccttgg ccacttcaga agccaccaca 19080 

gatacacaca caatgcatcc ttctataaac acagcaatgg ccaatgtggg gaccaccagt 19140 

tcaccaaatg aattctattt tactgtctca cctgactcag acccatataa agccacatcc 19200 

gcagtagtta tcacttccac ctcgggggac tcaatagttt ccacatcaat gcctagatcc 19260 

tctgcgatga aaaagattga gtctgagaca actttctccc tgatatttag actgagggag 19320 

actagcacct cccagaaaat tggctcatcc tcagacacaa gcacggtctt tgacaaagca '19380 

ttcactgctg ctactactga ggtctccaga acagaactca cctcctctag cagaacatcc 19440 

atccaaggca ctgasuiagcc cacaatgtca ccggacacct ccacaagatc tgtcaccatg 19500 

ctttctactt ttgctggcct gacaaaatcc gaagaaagga ccattgccac'ccaaacaggt 19560 

cctcataggg cgacatcaca gggtaccctt acctgggaca catcaatcac aacctcacag 19620 

gcagggaccc actcagctat gactcatgga ttttcacaat tagatttgtc cactcttacg 19680 
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agtagagttc ctgagtacat atcagggaca agcccaccct ctgtggaaaa aaccagctct 19740 

tcctcttccc ttctgtcttt accagcaata acctcaccgt cccctgtacc tactacatta 19800 

ccagaaagta ggccgtcttc tcctgttcat ctgacttcac tccccacctc tggcctagtg 19860 

aagaccacag atatgctggc atctgtggcc agtttacctc caaacttggg cagcacctca 19920 

cataagatac cgactacttc agaagacatt aaagatacag agaaaatgta tccttccaca 19980 

aacatagcag taaccaatgt ggggaccacc acttctgaaa aggaatctta ttcgtctgtc 20040 

ccagcctact cagaaccacc caaagtcacc tctccaatgg ttacctcttt caacataagg 20100 

gacaccattg tttccacatc catgcctggc tcctctgaga ttacaaggat tgagatggag 20160 

tcaacattct ccgtggctca tgggctgaag ggaaccagca cctcccagga ccccatcgta 20220 

tccacagaga aaagtgctgt ccttcacaag ttgaccactg gtgctactga gacctctagg 20280 

acagaagttg cctcttctag aagaacatcc attccaggcc ctgatcattc cacagagtca 20340 

ccagacatct ccactgaagt gatccccagc ctgcctatct cccttggcat tacagaatct 20400 

tcaaatatga ccatcatcac tcgaacaggt cctcctcttg gctctacatc acagggcaca 20460 

tttaccttgg acacaccaac tacatcctcc agggcaggaa cacactcgat ggcgactcag 20520 

gaatttccac actcagaaat gaccactgtc atgaacaagg accctgagat tctatcatgg 20580 

acaatccctc cttctataga gaaaaccagc ttctcctctt ccctgatgcc ttcaccagcc 20640 

atgacttcjac ctcctgtttc ctcaacatta ccaaagacca ttcacaccac tccttctcct 20700 

atgacctcac tgctcacccc tagcctagtg atgaccacag acacattggg cacaagccca 20760 

gaacctacaa ccagttcacc tccaaatttg agcagtacct cacatgtgat actgacaaca 20820 

gatgaagaca ccacagctat agaagccatg catccttcca caagcacagc agcgactaat 20880 

gtggaaacca cctgttctgg acatgggtca caatcctctg tcctaactga ctcagaaaaa 20940 

accaaggcca cagctccaat ggataccacc tccaccatgg ggcatacaac tgtttccaca 21000 

tcaatgtctg tttcctctga gactacaaaa attaagagag agtcaacata ttccttgact 21060 

cctggactga gagagaccag catttcccaa aatgccagct tttccactga cacaagtatt 21120 

gttctttcag aagtccccac tggtactact gctgaggtct ccaggacaga agtcacctcc 21180 

tctggtagaa catccatccc tggcccttct cagtccacag ttttgccaga aatatccaca 21240 

agaacaatga caaggctctt tgcctcgccc accatgacag aatcagcaga aatgaccatc 21300 

cccactcaaa caggtccttc tgggtctacc tcacaggata cccttacctt ggacacatcc 21360 

accacaaagt cccaggcaaa gactcattca actttgactc agagatttcc acactcagag 21420 

atgaccactc tcatgagcag aggtcctgga gatatgtcat ggcaaagctc tccctctctg 21480 

gaaaatccca gctctctccc ttccctgctg tctttacctg ccacaacctc acctcctccc 21540 

atttcctcca cattaccagt gactatctcc tcctctcctc ttcctgtgac ttcacttctc 21600 
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acctctagcc cggtaacgac cacagacatg ttacacacaa gcccagaact tgtaaccagt 21660 

tcacctccaa agctgagcca cacttcagat gagagactga ccactggcaa ggacaccaca 21720 

aatacagaag ctgtgcatcc ttccacaaac acagcagcgt ccaatgtgga gattcccagc 21780 

tttggacatg aatccccttc ctctgcctta gctgactcag agacatccaa agccacatca 21840 

ccaatgttta ttacctccac ccaggaggat acaactgttg ccatatrcaac ccctcacttc 21900 

ttggagacta gcagaattca gaaagagtca atttcctccc tgagccctaa attgagggag 21960 

acaggcagtt ctgtggagac aagctcagcc atagagacaa gtgctgtcct ttctgaagtg 22020 

tccattggtg ctactactga gatctccagg acagaagtca cctcctctag cagaacatcc 22080 

atctctggtt ctgctgagtc cacaatgttg ccagaaatat ccaccacaag aaaaatcatt 22140 

aagttcccta cttcccccat cctggcagaa tcatcagaaa tgaccatcaa gacccaaaca 22200 

agtcctcctg ggtctacatc agagagtacc tttacattag acacatcaac cactccctcc 22260 

ttggtaataa cccattcgac tatgactcag agattgccac actcagagat aaccactctt 22320 

gtgagtagag gtgctgggga tgtgccacgg cccagctctc tccctgtgga agaaacaagc 22380 

cctccatctt cccagctgtc tttatctgcc atgatctcac cttctcctgt ttcttccaca 22440 

ttaccagcaa gtagccactc ctcttctgct tctgtgactt cacctctcac accaggccaa 22500 

gtgaagacta ctgaggtgtt ggacgcaagt gcagaacctg aaaccagttc acctccaagt 22560 

ttgagcagca cctcagttga aatactggcc acctctgaag tcaccacaga tacggagaaa 22620 

attcatcctt tcccaaacac ggcagtaacc aaagttggaa cttccagttc tggacatgaa 22680 

tccccttcct ctgtcctacc tgactcagag acaaccaaag ccacatcggc aatgggtacc 22740 

atctccatta tgggggatac aagtgtttct acattaactc ctgccttatc taacactagg 22800 

aaaattcagt cagagccagc ttcctcactg accaccagat tgagggagac cagcacctct 22860 

gaagagacca gcttagccac agaagcaaac actgttcttt ctaaagtgtc cactggtgct 22920 

actactgagg tctccaggac agaagccatc tcctttagca gaacatccat gtcaggccct 22980 

gagcagtcca caatgtcaca agacatctcc ataggaacca tccccaggat ttctgcctcc 23040 

tctgtcctga cagaatctgc aaaaatgacc atcacaaccc aaacaggtcc ttcggagtct 23100 

acactagaaa gtacccttaa tttgaacaca gcaaccacac cctcttgggt ggaaacccac 23160 

tctatagtaa ttcagggatt tccacaccca gagatgacca cttccatggg cagaggtcct 23220 

ggaggtgtgt catggcctag' ccctcccttt gtgaaagaaa ccagccctcc atcctccccg 23280 

ctgtctttac ctgccgtgac ctcacctcat cctgtttcca ccacattcct agcacatatc 23340 

cccccctctc cccttcctgt gacttcactt ctcacctctg gcccggcgac aaccacagat 23400 

atcttgggta caagcacaga acctggaacc agttcatctt caagtttgag caccacctcc 23460 
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catgagagac tgaccactta caaagacact gcacatacag aagccgtgca tccttccaca 23520 

aacacaggag ggaccaatgt ggcaaccacc agctctggat ataaatcaca gtcctctgtc 23580 

ctagctgact catctccaat gtgtaccacc tccaccatgg gggatacaag tgttctcaca 23640 

tcaactcctg ccttccttga gactaggagg attcagacag agctagcttc ctccctgacc 23700 

cctggattga gggagtccag tggctctgaa gggaccagct caggcaccaa gatgagcact 23760 

gtcctctcta aagtgcccac tggtgctact actgagatct ccaaggaaga cgtcacctcc 23820 

atcccaggtc ccgctcaatc cacaatatca ccagacatct ccacaagaac cgtcagctgg 23880 

ttctctacat cccctgtcat gacagaatca gcagaaataa ccatgaacac ccatacaagt 23940 

cctttagggg ccacaacaca aggcaccagt actttggcca cgtcaagcac aacctctttg 24000 

acaatgacac actcaactat atctcaagga ttttcacact cacagatgag cactcttatg 24060 

aggaggggtc ctgaggatgt atcatggatg agccctcccc ttctggaaaa aactagacct 24120 

tccttttctc tgatgtcttc accagccaca acttcacctt ctcctgtttc ctccacatta 24180 

ccagagagca tctcttcctc tcctcttcct gtgacttcac tcctcacgtc tggcttggca 24240 

aaaactacag atatgttgca caaaagctca gaacctgtaa ccaactcacc tgcaaatttg 24300 

agcagcacct cagttgaaat actggccacc tctgaagtca ccacagatac agagaaaact 24360 

catccttctt caaacagaac agtgaccgat gtggggacct ccagttctgg acatgaatcc 24420 

acttcctttg tcctagctga ctcacagaca tccaaagtca catctccaat ggttattacc 24480 

tccaccatgg aggatacgag tgtctccaca tcaactcctg gcttttttga gactagcaga 24540 

attcagacag aaccaacatc ctccctgacc cttggactga gaaagaccag cagctctgag 24600 

gggaccagct tagccacaga gatgagcact gtcctttctg gagtgcccac tggtgccact 24660 

gctgaagtct ccaggacaga agtcacctcc tctagcagaa catccatctc aggctttgct 24720 

cagctcacag tgtcaccaga gacttccaca gaaaccatca ccagactccc tacctccagc 24780 

ataatgacag aatcagcaga aatgatgatc aagacacaaa cagatcctcc tgggtctaca 24840 

ccagagagta ctcatactgt ggacatatca acaacaccca actgggtaga aacccactcg 24900 

actgtgactc agagattttc acactcagag atgaccactc ttgtgagcag aagccctggt 24960 

gatatgttat ggcctagtca atcctctgtg gaagaaacca gctctgcctc ttccctgctg 25020 

tctctgcctg ccacgacctc accttctcct gtttcctcta cattagtaga ggatttccct 25080 

tccgcttctc ttcctgtgac ttctcttctc acccctggcc tggtgataac cacagacagg 25140 

atgggcataa gcagagaacc tggaaccagt tccacttcaa atttgagcag cacctcccat 25200 

gagagactga ccactttgga agacactgta gatacagaag acatgcagcc ttccacacac 25260 

acagcagtga ccaacgtgag gacctccatt tctggacatg aatcacaatc ttctgtccta 25320 

tctgactcag agacacccaa agccacatct ccaatgggta ccacctacac catgggggaa 25380 
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acgagtgttt ccatatccac ttctgacttc tttgagacca gcagaattca gatagaacca 25440 
acatcctccc tgacttctgg attgagggag accagcagct ctgagaggat cagctcagcc 25500 
acagagggaa gcactgtcct ttctgaagtg cccagtggtg ctaccactga ggtctccagg 25560 



ccagacatct ctactgaagc gatcaccagg ctttctactt cccccattat gacagaatca 25680 

gcagaaagtg ccatcactat tgagacaggt tctcctgggg ctacatcaga gggtaccctc 25740 

accttggaca cctcaacaac aaccttttgg tcagggaccc actcaactgc atctccagga 25800 

ttttcacact cagagatgac cactcttatg agtagaactc ctggagatgt gccatggccg 25860 

agccttccct ctgtggaaga agccagctct gtctcttcct cactgtcttc acctgccatg 25920 

acctcaactt cttttttctc cgcattacca gagagcatct cctcctctcc tcatcctgtg 25980 

actgcacttc tcacccttgg cccagtgaag accacagaca tgttgcgcac aagctcagaa 26040 

cctgaaacca gttcacctcc aaatttgagc agcacctcag ctgaaatatt agccacgtct 26100 

gaagtcacca aagatagaga gaaaattcat ccctcctcaa acacacctgt agtcaatgta 26160 

gggactgtga tttataaaca tctatcccct tcctctgttt tggctgactt agtgacaaca 26220 

aaacccacat ctccaatggc taccacctcc actctgggga atacaagtgt ttccacatca 26280 

actcctgcct tcccagaaac tatgatgaca cagccaactt cctccctgac ttctggatta 26340 

agggagatca gtacctctca agagaccagc tcagcaacag agagaagtgc ttctctttct 26400 

ggaatgccca ctggtgctac tactaaggtc tccagaacag aagccctctc cttaggcaga 26460 

acatccaccc caggtcctgc tcaatccaca atatcaccag aaatctccac ggaaaccatc 26520 

actagaattt ctactcccct caccacgaca ggatcagcag aaatgaccat cacccccaaa 26580 

acaggtcatt ctggggcatc ctcacaaggt acctttacct tggacacatc aagcagagcc 26640 

tcctggccag gaactcactc agctgcaact cacagatctc cacactcagg gatgaccact 26700 

cctatgagca gaggtcctga ggatgtgtca tggccaagcc gcccatcagt ggaaaaaact 26760 

agccctccat cttccctggt gtctttatct gcagtaacct caccttcgcc actttattcc 26820 

acaccatctg agagtagcca ctcatctcct ctccgggtga cttctctttt cacccctgtc 26880 

atgatgaaga ccacagacat gttggacaca agcttggaac ctgtgaccac ttcacctccc 26940 

agtatgaata tcacctcaga tgagagtctg gccacttcta aagccaccat ggagacagag 27000 

gcaattcagc tttcagaaaa cacagctgtg actcagatgg gcaccatcag cgctagacaa 27060 

gaattctatt cctcttatcc aggcctccca gagccatcca aagtgacatc tccagtggtc 27120 

acctcttcca ccataaaaga cattgtttct acaaccatac ctgcttcctc tgagataaca 27180 

agaattgaga tggagtcaac atccaccctg acccccacac caagggagac cagcacctcc 27240 



acagaagtga tatcctctag gggaacatcc atgtcagggc ctgatcagtt caccatatca 



25620 
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caggagatcc actcagccac aaagccaagc actgttcctt acaaggcact cactagtgcc 27300 

acgattgagg actccatgac acaagtcatg tcctctagca gaggacctag ccctgatcag 27360 

tccacaatgt cacaagacat atccagtgaa gtgatcacca ggctctctac ctcccccatc 27420 

aaggcagaat ctacagaaat gaccattacc acccaaacag gttctcctgg ggctacatca 27480 

aggggtaccc ttaccttgga cacttcaaca acttttatgt cagggaccca ctcaactgca 27540 

tctcaaggat tttcacactc acagatgacc gctcttatga gtagaactcc tggagatgtg 27600 

ccatggctaa gccatccctc tgtggaagaa gccagctctg cctctttctc actgtcttca 27660 

cctgtcatga cctcatcttc tcccgtttct tccacattac cagacagcat ccactcttct 27720 

tcgcttcctg tgacatcact tctcacctca gggctggtga agaccacaga gctgttgggc 27780 

acaagctcag aacctgaaac cagttcaccc ccaaatttga gcagcacctc agctgaaata 27840 

ctggccacca ctgaagtcac tacagataca gagaaactgg agatgaccaa tgtggtaacc 27900 

tcaggttata cacatgaatc tccttcctct gtcctagctg actcagtgac aacaaaggcc 27960 

acatcttcaa tgggtatcac ctaccccaca ggagatacaa atgttctcac atcaacccct 28020 

gccttctctg acaccagtag gattcaaaca aagtcaaagc tctcactgac tcctgggttg 28080 

atggagacca gcatctctga agagaccagc tctgccacag aaaaaagcac tgtcctttct 28140 

agtgtgccca ctggtgctac tactgaggtc tccaggacag aagccatctc ttctagcaga 28200 

acatccatcc caggccctgc tcaatccaca atgtcatcag acacctccat ggaaaccatc 28260 

actagaattt ctacccccct cacaaggaaa gaatcaacag acatggccat cacccccaaa 28320 

acaggtcctt ctggggctac ctcgcagggt acctttacct tggactcatc aagcacagcc 28380 

tcctggccag gaactcactc agctacaact cagagatttc cacagtcagt ggtgacaact 28440 

cctatgagca gaggtcctga ggatgtgtca tggccaagcc cgctgtctgt ggaaaaaaac 28500 

agccctccat cttccctggt atcttcatct tcagtaacct caccttcgcc actttattcc 28560 

acaccatctg ggagtagcca ctcctctcct gtccctgtca cttctctttt cacctctatc 28620 

atgatgaagg ccacagacat gttggatgca agtttggaac ctgagaccac ttcagctccc 28680 

aatatgaata tcacctcaga tgagagtctg gccacttcta aagccaccac ggagacagag 28740 

gcaattcacg tttttgaaaa tacagcagcg tcccatgtgg aaaccaccag tgctacagag 28800 

gaactctatt cctcttcccc aggcttctca gagccaacaa aagtgatatc tccagtggtc 28860 

acctcttcct ctataagaga caacatggtt tccacaacaa tgcctggctc ctctggcatt 28920 

acaaggattg agatagagtc aatgtcatct ctgacccctg gactgaggga gaccagaacc 28980 

tcccaggaca tcacctcatc cacagagaca agcactgtcc tttacaagat gtcctctggt 29040 

gccactcctg aggtctccag gacagaagtt atgccctcta gcagaacatc cattcctggc 29100 

cctgctcagt ccacaatgtc actagacatc tccgatgaag ttgtcaccag gctgtctacc 29160 

Page 150 



wo 2004/04SSS3 



PCT/US2003/037041 





022438. 43867 (pct2) .ST25.txt 



tctcccatca tgacagaatc tgcagaaata accatcacca cccaaacagg ttattctctg 29220 
gctacatccc aggttaccct tcccttgggc acctcaatga cctttttgtc agggacccac 29280 
tcaactatgt ctcaaggact ttcacactca gagatgacca atcttatgag caggggtcct 29340 



acatcattac ctctcacgac ctcactttct cctgtgtcct ccacattact agacagtagc 29460 

ccctcctctc ctcttcctgt gacttcactt atcctcccag gcctggtgaa gactacagaa 29520 

gtgttggata caagctcaga gcctaaaacc agttcatctc caaatttgag cagcacctca 29580 

gttgaaatac cggccacctc tgaaatcatg acagatacag agaaaattca tccttcctca 29640 

aacacagcgg tggccaaagt gaggacctcc agttctgttc atgaatctca ttcctctgtc 29700 

ctagctgact cagaaacaac cataaccata ccttcaatgg gtatcacctc cgctgtggac 29760 

gataccactg ttttcacatc aaatcctgcc ttctctgaga ctaggaggat tccgacagag 29820 

ccaacattct cattgactcc tggattcagg gagactagca cctctgaaga gaccacctca 29880 

atcacagaaa caagtgcagt cctttatgga gtgcccacta gtgctactac tgaagtctcc 29940 

atgacagaaa tcatgtcctc taatagaaca cacatccctg actctgatca gtccacgatg 30000 

tctccagaca tcatcactga agtgatcacc aggctctctt cctcatccat gatgtcagaa 30060 

tcaacacaaa tgaccatcac cacccaaaaa agttctcctg gggctacagc acagagtact 30120 

cttaccttgg ccacaacaac agcccccttg gcaaggaccc actcaactgt tcctcctaga 30180 

tttttacact cagagatgac aactcttatg agtaggagtc ctgaaaatcc atcatggaag 30240 

agctctccct ttgtggaaaa aactagctct tcatcttctc tgttgtcctt acctgtcacg 30300 

acctcacctt ctgtttcttc cacattaccg cagagtatcc cttuctcctc tttttctgtg 30360 

acttcactcc tcaccccagg . catggtgaag actacagaca caagcacaga acctggaacc 30420 

agtttatctc caaatctgag tggcacctca gttgaaatac tggctgcctc tgaagtcacc 30480 

acagatacag agaaaattca tccttcttca agcatggcag tgaccaatgt gggaaccacc 30540 

agttctggac atgaactata ttcctctgtt tcaatccact cggagccatc caaggctaca 30600 

tacccagtgg gtactccctc ttccatggct gaaacctcta tttccacatc aatgcctgct 30660 

aattttgaga ccacaggatt tgaggctgag ccattttctc atttgacttc tggatttagg 30720 

aagacaaaca tgtccctgga caccagctca gtcacaccaa caaatacacc ttcttctcct 30780 

gggtccactc accttttaca gagttccaag actgatttca cctcttctgc aaaaacatca 30840 

tccccagact ggcctccagc ctcacagtat actgaaattc cagtggacat aatcaccccc 30900 

tttaatgctt ctccatctat tacggagtcc actgggataa cctccttccc agaatccagg 30960 

tttactatgt ctgtaacaga aagtactcat catctgagta cagatttgct gccttcagct 31020 



gaaagtctgt catggacgag ccctcgcttt gtggaaacaa ctagatcttc ctcttctctg 



29400 



Page 151 



wo 2004/045553 PCT/US2003/037041 





022438 . 43867 (pct2) . ST25 . txt 
ccactggcac agtgatgcct tctctatcag aggccatgac ttcatttgcc 




aCCaCL99^9 


ttccacgagc catctcaggt 


tcaggtagtc 


cattctctag 


gacagagtca 




ggccctgggg 


atgctactct gtccaccatt 


gcagagagcc 


tgccttcatc 


cactcctgtg 




^ 1 

CCat XCtCCX 


cttcaacctt cactaccact 


gattcttcaa 


ccatcccagc 


cctccatgag 




clLaaCTxCCX 


cttcagctac cccatataga 


gtggacacca 


gtcttgggac 


agagagcagc 




dCLdCXgclag 


gacgcttggt tatggtcagt 


actttggaca 


cttcaagcca 


accaggcagg 


3X3oU 


ECaXCLLCaa 


cacccatttt ggataccaga 


atgacagaga 


gcgttgagct 


gggaacagtg 


3X440 


acaagtgctt 


atcaagttcc ttcactctca 


acacggttga 


caagaactga 


tggcattatg 


3-1 CAA 
31500 


gaacacatca 


caaaaatacc caatgaagca 


gcacacagag 


gtaccataag 


accagtcaaa 


51560 


ggccctcaga 


catccacttc gcctgccagt 


cctaaaggac 


tacacacagg 


agggacaaaa 


31620 


agaatggaga 


ccaccaccac agctttgaag 


accaccacca 


cagctttgaa 


g^ccacttcc 


31 COA 

31680 


agagccacct 


tgaccaccag tgtctatact 


cccactttgg 


gaacactgac 


tcccctcaat 


31 7>l A 

31740 


gcaxcaaggc 


aaatggccag cacaatcctc 


acagaaatga 


tgatcacaac 


cccatatgtt 


3*1 OAA 
3I0OO 


ttccctgatg 


ttccagaaac gacatcctca 


ttggctacca 


gcctgggagc 


agaaaccagc 


51860 


acagctcttc 


ccaggacaac cccatctgtt 


ctcaatagag 


aatcagagac 


cacagcctca 


3*1 A^) A 

51920 


ctggtctctc 


gttctggggc agagagaagt 


ccggttattc 


aaactctaga 


tgtttcttct 


31980 


agtgagccag 


atacaacagc ttcatgggtt 


atccatcctg 


cagagaccat 


CGcaactgtt 


52040 


tccaagacaa 


cccccaattt tttccacagt 


gaattagaca 


ctgtatcttc 


cacagccacc 


52100 


agtcatgggg 


cagacgtcag ctcagccatt 


ccaacaaata 


tctcacctag 


tgaactagat 


3**1 CA 

52160 


gcictgaccc 


cactggtcac tatttcgggg 


acagatacta 


gtacaacatt 


cccaacactg 


52220 


actaagtccc 


cacatgaaac agagacaaga 


accacatggc 


tcactcatcc 


tgcagagacc 


52280 


agctcaacta 


ttcccagaac aatccccaat 


ttttctcatc 


atgaatcaga 


tgccacacct 


52340 


tcaatagcca 


ccagtcctgg ggcagaaacc 


agttcagcta 


ttccaattat 


gactgtctca 


52400 


cctggtgcag 


aagatctggt gacctcacag 


gtcactagtt 


ctgggacaga 


cagaaatatg 


52460 


acrattccHa 


ctttgactct ttctcctggt 


gaaccaaaga 


cgatagcctc 


attagtcacc 


3 3 C3 A 

3Z520 


caLCCugaag 


cacagacaag ttcggccatt 


ccaacttcaa 


ctatctcgcc 


tgctgtatca 


33 C D A 
3Z30O 


^ ^"t #^ lift* 

cggttggtga 


cctcaatggt caccagtttg 


gcggcaaaga 


caagtacaac 


taatcgagct 


33C>I A 

3Zo40 


ctgacaaact 


cccctggtga accagctaca 


acagtttcat 


tggtcacgca 


tcctgcacag 


32700 


accagcccaa 


cagttccctg gacaacttcc 


atttttttcc 


atagtaaatc 


agacaccaca 


32760 


ccttcaatga 


ccaccagtca tggggcagaa 


tccagttcag 


ctgttccaac 


tccaactgtt 


32820 


tcaactgagg 


taccaggagt agtgacccct 


ttggtcacca 


gttctagggc 


agtgatcagt 


32880 


acaactattc 


caattctgac tctttctcct 


ggtgaaccag agaccacacc 
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accagtcatg gggaagaagc cagttctgct attccaactc caactgtttc acctggggta 
ccaggagtgg tgacctctct ggtcactagt tctagggcag tgactagtac aactattcca 
attctgactt tttctcttgg tgaaccagag accacacctt caatggccac cagtcatggg 
acagaagctg gctcagctgt tccaactgtt ttacctgagg taccaggaat ggtgacctct 
ctggttgcta gttctagggc agtaaccagt acaactcttc caactctgac tctttctcct 
ggtgaaccag agaccacacc ttcaatggcc accagtcatg gggcagaagc cagctcaact 
gttccaactg tttcacctga ggtaccagga gtggtgacct ctctggtcac tagttctagt 
ggagtaaaca gtacaagtat tccaactctg attctttctc ctggtgaact agaaaccaca 
ccttcaatgg ccaccagtca tggggcagaa gccagctcag ctgttccaac tccaactgtt 
tcacctgggg tatcaggagt ggtgacccct ctggtcacta gttccagggc agtgaccagt 
acaactattc caattctaac tctttcttct agtgagccag agaccacacc ttcaatggcc 
accagtcatg gggtagaagc cagctcagct gttctaactg tttcacctga ggtaccagga 
atggtgacct ctctggtcac tagttctaga gcagtaacca gtacaactat tccaactctg 
actatttctt ctgatgaacc agagaccaca acttcattgg tcacccattc tgaggcaaag 
atgatttcag ccattccaac tttagctgtc tcccctactg tacaagggct ggtgacttca 
ctggtcacta gttctgggtc agagaccagt gcgttttcaa atctaactgt tgcctcaagt 
caaccagaga ccatagactc atgggtcgct catcctggga cagaagcaag ttctgttgtt 
ccaactttga ctgtctccac tggtgagccg tttacaaata tctcattggt cacccatcct 
gcagagagta gctcaactct tcccaggaca acctcaaggt tttcccacag tgaattagac 
actatgcctt ctacagtcac cagtcctgag gcagaatcca gctcagccat ttcaactact 
atttcacctg gtataccagg tgtgctgaca tcactggtca ctagctctgg gagagacatc 
agtgcaactt ttccaacagt gcctgagtcc ccacatgaat cagaggcaac agcctcatgg 
gttactcatc ctgcagtcac cagcacaaca gttcccagga caacccctaa ttattctcat 
agtgaaccag acaccacacc atcaatagcc accagtcctg gggcagaagc cacttcagat 
tttccaacaa taactgtctc acctgatgta ccagatatgg taacctcaca ggtcactagt 
tctgggacag acaccagtat aactattcca actctgactc tttcttctgg tgagccagag 
accacaacct catttatcac ctattctgag acacacacaa gttcagccat tccaactctc 
cctgtctccc ctggtgcatc aaagatgctg acctcactgg tcatcagttc tgggacagac 
agcactacaa ctttcccaac actgacggag accccatatg aaccagagac aacagccata 
cagctcattc atcctgcaga gaccaacaca atggttccca agacaactcc caagttttcc 
catagtaagt cagacaccac actcccagta gccatcacca gtcctgggcc agaagccagt 



33000 

33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

34500 

34560 

34620 

34680 

34740 

34800 
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tcagctgttt 


caacgacaac 


tatctcacct 


gatatgtcag 


atctggtgac 


ctcactggtc 


34860 


cctagttctg 


ggacagacac 


cagtacaacc 


ttcccaacat 


tgagtgagac 


cccatatgaa 


34920 


ccagagacta 


cagtcacgtg 


gctcactcat 


cctgcagaaa 


ccagcacaac 


ggtttctggg 


34980 


acaattccca 


acttttccca 


taggggatca 


gacactgcac 


cctcaatggt 


caccagtcct 


35040 


ggagtagaca 


cgaggtcagg 


tgttccaact 


acaaccatcc 


cacccagtat 


accaggggta 


35100 


gtgacctcac 


aggtcactag 


ttctgcaaca 


gacactagta 


cagctattcc 


aactttgact 


35160 


ccttctcctg 


gtgaaccaga 


gaccacagcc 


tcatcagcta 


cccatcctgg 


gacacagact 


35220 


ggcttcactg 


ttccaattcg 


gactgttccc 


tctagtgagc 


cagatacaat 


ggcttcctgg 


35280 


gtcactcatc 


ctccacagac 


cagcacacct 


gtttccagaa 


caacctccag 


tttttcccat 


35340 


agtagtccag 


atgccacacc 


tgtaatggcc 


accagtccta 


ggacagaagc 


cagttcagct 


35400 


gtactgacaa 


caatctcacc 


tggtgcacca 


gagatggtga 


cttcacagat 


cactagttct 


35460 


ggggcagcaa 


ccagtacaac 


tgttccaact 


ttgactcatt 


ctcctggtat 


gccagagacc 


35520 


acagccttat 


tgagcaccca 


tcccagaaca 


gggacaagta 


aaacatttcc 


tgcttcaact 


35580 


gtgtttcctc 


aagtatcaga 


gaccacagcc 


tcactcacca 


ttagacctgg 


tgcagagact 


35640 


agcacagctc 


tcccaactca 


gacaacatcc 


tctctcttca 


ccctacttgt 


aactggaacc 


35700 


agcagagttg 


atctaagtcc 


aactgcttca 


cctggtgttt 


ctgcaaaaac 


agccccactt 


35760 


tccacccatc 


cagggacaga 


gaccagcaca 


atgattccaa 


cttcaactct 


ttcccttggt 


35820 


ttactagaga 


ctacaggctt 


actggccacc 


agctcttcag 


cagagaccag 


cacgagtact 


35880 


ctaactctga 


ctgtttcccc 


tgctgtctct 


gggctttcca 


gtgcctctat 


aacaactgat 


35940 


aagccccaaa 


ctgtgacctc 


ctggaacaca 


gaaacctcac 


catctgtaac 


ttcagttgga 


36000 


cccccagaat 


tttccaggac 


tgtcacaggc 


accactatga 


ccttgatacc 


atcagagatg 


36060 


ccaacaccac 


ctaaaaccag 


tcatggagaa 


ggagtgagtc 


caaccactat 


cttgagaact 


36120 


acaatggttg 


aagccactaa 


tttagctacc 


acaggttcca 


gtcccactgt 


ggccaagaca 


36180 


acaaccacct 


tcaatacact 


ggctggaagc 


ctctttactc 


ctctgaccac 


acctgggatg 


36240 


tccaccttgg 


cctctgagag 


tgtgacctca 


agaacaagtt 


ataaccatcg 


gtcctggatc 


36300 


tccaccacca 


gcagttataa 


ccgtcggtac 


tggacccctg 


ccaccagcac 


tccagtgact 


36360 


tctacattct 


ccccagggat 


ttccacatcc 


tccatcccca 


gctccacagc 


agccacagtc 


36420 


ccattcatgg 


tgccattcac 


cctcaacttc 


accatcacca 


acctgcagta 


cgaggaggac 


36480 


atgcggcacc 


ctggttccag 


gaagttcaac 


gccacagaga 


gagaactgca 


gggtctgctc 


36540 


aaacccttgt 


tcaggaatag 


cagtctggaa 


tacctctatt 


caggctgcag 


actagcctca 


36600 


ctcaggccag 


agaaggatag 


ctcagccatg 


gcagtggatg 


ccatctgcac 


acatcgccct 


36660 


gaccctgaag 


acctcggact 


ggacagagag 


cgactgtact 


gggagctgag 


caatctgaca 


36720 
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aatggcatcc aggagctggg cccctacacc ctggaccgga acagtctcta tgtcaatggt 36780 

ttcacccatc gaagctctat gcccaccacc agcactcctg ggacctccac agtggatgtg 36840 

ggaacctcag ggactccatc ctccagcccc agccccacgg ctgctggccc tctcctgatg 36900 

ccgttcaccc tcaacttcac catcaccaac ctgcagtacg aggaggacat gcgtcgcact 36960 

ggctccagga agttcaacac catggagagt gtcctgcagg gtctgctcaa gcccttgttc 37020 

aagaacacca gtgttggccc tctgtactct ggctgcagat tgaccttgct caggcccgag 37080 

aaagatgggg cagccactgg agtggatgcc atctgcaccc accgccttga ccccaaaagc 37140 

cctggactca acagggagca gctgtactgg gagctaagca aactgaccaa tgacattgaa 37200 

gagctgggcc cctacaccct ggacaggaac agtctctatg tcaatggttt cacccatcag 37260 

agctctgtgt ccaccaccag cactcctggg acctccacag tggatctcag aacctcaggg 37320 

actccatcct ccctctccag ccccacaatt atggctgctg gccctctcct ggtaccattc 37380 

accctcaact tcaccatcac caacctgcag tatggggagg acatgggtca ccctggctcc 37440 

aggaagttca acaccacaga gagggtcctg cagggtctgc ttggtcccat attcaagaac 37500 

accagtgttg gccctctgta ctctggctgc agactgacct ctctcaggtc tgagaaggat 37560 

ggagcagcca ctggagtgga tgccatctgc atccatcatc ttgaccccaa aagccctgga 37620 

ctcaacagag agcggctgta ctgggagctg agccaactga ccaatggcat caaagagctg 37680 

ggcccctaca ccctggacag gaacagtctc tatgtcaatg gtttcaccca tcggacctct 37740 

gtgcccacca ccagcactcc tgggacctcc acagtggacc ttggaacctc agggactcca 37800 

ttctccctcc caagccccgc aactgctggc cctctcctgg tgctgttcac cctcaacttc 37860 

accatcacca acctgaagta tgaggaggac atgcatcgcc ctggctccag gaagttcaac 37920 

accactgaga gggtcctgca gactctgctt ggtcctatgt tcaagaacac cagtgttggc 37980 

cttctgtact ctggctgcag actgaccttg ctcaggtccg agaaggatgg agcagccact 38040 

ggagtggatg ccatctgcac ccaccgtctt gaccccaaaa gccctggact ggacagagag 38100 

cagctatact gggagctgag ccagctgacc aatggcatca aagagctggg cccctacacc 38160 

ctggacagga acagtctcta tgtcaatggt ttcacccatt ggatccctgt gcccaccagc 38220 

agcactcctg ggacctccac agtggacctt gggtcaggga ctccatcctc cctccccagc 38280 

cccacagctg ctggccctct cctggtgcca ttcaccctca acttcaccat caccaacctg 38340 

cagtacgagg aggacatgca tcacccaggc tccaggaagt tcaacaccac ggagcgggtc 38400 

ctgcagggtc tgcttggtcc catgttcaag aacaccagtg tcggccttct gtactctggc 38460 

tgcagactga ccttgctcag gtccgagaag gatggagcag ccactggagt ggatgccatc 38520 

tgcacccacc gtcttgaccc caaaagccct ggagtggaca gggagcagct atactgggag 38580 
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ctgagccagc tgaccaatgg catcaaagag ctgggtccct acaccctgga cagaaacagt 38640 

ctctatgtca atggtttcac ccatcagacc tctgcgccca acaccagcac tcctgggacc 38700 

tccacagtgg accttgggac ctcagggact ccatcctccc tccccagccc tacatcngct 38760 

ggccctctcc tggtnccntt caccctcaac ttcaccatca ccaacctgca gtacgaggag 38820 

gacatgcggc acccnggntc caggaagttc aacaccacng agagggtnct gcagggtctg 38880 

ctnaagcccc tnttcaagag caccagtgtt ggccctctgt actctggctg cagactgacc 38940 

ttgctcaggt ccgagaagga tggagcagcc actggagtgg atgccatctg cacccaccgt 39000 

cttgacccca aaagccctgg agtggacagg gagcagctat actgggagct gagccagctg 39060 

accaatggca tcaaagagct gggtccctac accctggaca gaaacagtct ctatgtcaat 39120 

ggtttcaccc atcagacctc tgcgcccaac accagcactc ctgggacctc cacagtggac 39180 

cttgggacct cagggactcc atcctccctc cccagcccta catctgctgg ccctctcctg 39240 

gtgccattca ccctcaactt caccatcacc aacctgcagt acgaggagga catgcatcac 39300 

ccaggctcca ggaagttcaa caccacggag cgggtcctgc agggtctgct tggtcccatg 39360 

ttcaagaaca ccagtgtcgg ccttctgtac tctggctgca gactgacctt gctcaggcct 39420 

gagaagaatg gggcagccac tggaatggat gccatctgca gccaccgtct tgaccccaaa 39480 

agccctggac tcaacagaga gcagctgtac tgggagctga gccagctgac ccatggcatc 39540 

aaagagctgg gcccctacac cctggacagg aacagtctct atgtcaatgg tttcacccat 39600 

cggagctctg tggcccccac cagcactcct gggacctcca cagtggacct tgggacctca 39660 

gggactccat cctccctccc cagccccaca acagctgttc ctctcctggt gccgttcacc 39720 

ctcaacttta ccatcaccaa tctgcagtat ggggaggaca tgcgtcaccc tggctccagg 39780 

aagttcaaca ccacagagag ggtcctgcag ggtctgcttg gtcccttgtt caagaactcc 39840 

agtgtcggcc ctctgtactc tggctgcaga ctgatctctc tcaggtctga gaaggatggg 39900 

gcagccactg gagtggatgc catctgcacc caccacctta accctcaaag ccctggactg 39960 

gacagggagc agctgtactg gcagctgagc cagatgacca atggcatcaa agagctgggc 40020 

ccctacaccc tggaccggaa cagtctctac gtcaatggtt tcacccatcg gagctctggg 40080 

ctcaccacca gcactccttg gacttccaca gttgaccttg gaacctcagg gactccatcc 40140 

cccgtcccca gccccacaac tgctggccct ctcctggtgc cattcaccct caacttcacc 40200 

atcaccaacc tgcagtatga ggaggacatg catcgccctg gatctaggaa gttcaacacc 40260 

acagagaggg tcctgcaggg tctgcttagt cccattttca agaactccag tgttggccct 40320 

ctgtactctg gctgcagact gacctctctc aggcccgaga aggatggggc agcaactgga 40380 

atggatgctg tctgcctcta ccaccctaat cccaaaagac ctggactgga cagagagcag 40440 

ctgtactggg agctaagcca gctgacccac aacatcactg agctgggccc ctacagcctg 40500 
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gacagggaca gtctctatgt caatggtttc acccatcaga actctgtgcc caccaccagt 40560 

actcctggga cctccacagt gtactgggca accactggga ctccatcctc cttccccggc 40620 

cacacagagc ctggccctct cctgatacca ttcactttca actttaccat caccaacctg 40680 

cattatgagg aaaacatgca acaccctggt tccaggaagt tcaacaccac ggagagggtt 40740 

ctgcagggtc tgctcaagcc cttgttcaag aacaccagtg ttggccctct gtactctggc 40800 

tgcagactga cctctctcag gcccgagaag gatggggcag caactggaat ggatgctgtc 40860 

tgcctctacc accctaatcc caaaagacct gggctggaca gagagcagct gtactgggag 40920 

ctaagccagc tgacccacaa catcactgag ctgggcccct acagcctgga cagggacagt 40980 

ctctatgtca atggtttcac ccatcagaac tctgtgccca ccaccagtac tcctgggacc 41040 

tccacagtgt actgggcaac cactgggact ccatcctcct tccccggcca cacagagcct 41100 

ggccctctcc tgataccatt cactttcaac tttaccatca ccaacctgca ttatgaggaa 41160 

aacatgcaac accctggttc caggaagttc aacaccacgg agagggttct gcagggtctg 41220 

ctcaagccct tgttcaagaa caccagtgtt ggccctctgt actctggctg cagactgacc 41280 

ttgctcagac ctgagaagca tgaggcagcc actggagtgg acaccatctg tacccaccgc 41340 

gttgatccca tcggacctgg actggacagg gagcggctat actgggagct gagccagctg 41400 

accaacagca ttaccgaact gggaccctac accctggaca gggacagtct ctatgtcaat 41460 

ggcttcaacc ctcggagctc tgtgccaacc accagcactc ctgggacctc cacagtgcac 41520 

ctggcaacct ctgggactcc atcctccctg cctggccaca cagcccctgt ccctctcttg 41580 

ataccattca ccctcaactt taccatcacc aacctgcatt atgaggaaaa catgcaacac 41640 

cctggttcca ggaagttcaa caccacggag agggttctgc agggtctgct caagcccttg 41700 

ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcagacct 41760 

gagaagcatg aggcagccac tggagtggac accatctgta cccaccgcgt tgatcccatc 41820 

ggacctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 41880 

nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 41940 

cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 42000 

gggactccat cctccntccc cngccncaca tctgctggcc ctctcctggt gccattcacc 42050 

ctcaacttca ccatcaccaa cctgcagtac gaggaggaca tgcatcaccc aggctccagg 42120 

aagttcaaca ccacggagcg ggtcctgcag ggtctgcttg gtcccatgtt caagaacacc 42180 

agtgtcggcc ttctgtactc tggctgcaga ctgaccttgc tcaggcctga gaagaatggg 42240 

gcagccactg gaatggatgc catctgcagc caccgtcttg accccaaaag ccctggactc 42300 

gacagagagc agctgtactg ggagctgagc cagctgaccc atggcatcaa agagctgggc 42360 
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cagtctctat gtcaatggtt tcacccatcg gagctctgtg 


42420 


gcccccacca 


gcactcctgg 


gacctccaca 


gtggaccttg 


ggacctcagg 


gactccatcc 


42480 


tccctcccca 


gccccacaac 


agctgttcct 


ctcctggtgc 


cgttcaccct 


caactttacc 


42540 


atcaccaatc 


tgcagtatgg 


ggaggacatg 


cgtcaccctg 


gctccaggaa 


gttcaacacc 


42600 


acagagaggg 


tcctgcaggg 


tctgcttggt 


cccttgttca 


agaactccag 


tgtcggccct 


42660 


ctgtactctg 


gctgcagact 


gatctctctc 


aggtctgaga 


aggatggggc 


agccactgga 


42720 


gtggatgcca 


tctgcaccca 


ccaccttaac 


cctcaaagcc 


ctggactgga 


cagggagcag 


42780 


ctgtactggc 


agctgagcca 


gatgaccaat 


ggcatcaaag 


agctgggccc 


ctacaccctg 


42840 


gaccggaaca 


gtctctacgt 


caatggtttc 


acccatcgga 


gctctgggct 


caccaccagc 


42900 


actccttgga 


cttccacjagt 


tgaccttgga 


acctcaggga 


ctccatcccc 


cgtccccagc 


42960 


cccacaactg 


ctggccctct 


cctggtgcca 


ttcaccctaa 


acttcaccat 


caccaacctg 


43020 


cagtatgagg 


aggacatgca 


tcgccctgga 


tctaggaagt 


tcaacgccac 


agagagggtc 


43080 


ctgcagggtc 


tgcttagtcc 


catattcaag 


aactccagtg 


ttggccctct 


gtactctggc 


43140 


tgcagactga 


cctctctcag 


gcccgagaag 


gatggggcag 


caactggaat 


ggatgctgtc 


43200 


tgcctctacc 


accctaatcc 


caaaagacct 


ggactggaca 


gagagcagct 


gtactgggag 


43260 


ctaagccagc 


tgacccacaa 


catcactgag 


ctgggcccct 


acagcctgga 


cagggacagt 


43320 


ctctatgtca 


atggtttcac 


ccatcagagc 


tctatgacga 


ccaccagaac 


tcctgatacc 


43380 


tccacaatgc 


acctggcaac 


ctcgagaact 


ccagcctccc 


tgtctggacc 


tacgaccgcc 


43440 


agccctctcc 


tggtgctatt 


cacaatcaac 


tgcaccatca 


ccaacctgca 


gtacgaggag 


43500 


gacatgcgtc 


gcactggctc 


caggaagttc 


aacaccatgg 


agagtgtcct 


gcagggtctg 


43560 


ctcaagccct 


tgttcaagaa 


caccagtgtt 


ggccctctgt 


actctggctg 


cagattgacc 


43620 


ttgctcaggc 


ccaagaaaga 


tggggcagcc 


actggagtgg 


atgccatctg 


cacccaccgc 


43680 


cttgacccca 


aaagccctgg 


actcaacagg 


gagcagctgt 


actgggagct 


aagcaaactg 


43740 


accaatgaca 


ttgaagagct 


gggcccctac 


accctggaca 


ggaacagtct 


ctatgtcaat 


43800 


ggtttcaccc 


atcagagctc 


tgtgtccacc 


accagcactc 


ctgggacctc 


cacagtggat 


43860 


ctcagaacct 


cagggactcc 


atcctccctc 


tccagcccca 


caattatgnc 


nnctgnccct 


43920 


ctcctgntnc 


cnttcaccnt 


caacttnacc 


atcaccaacc 


tgcantangn 


ggannacatg 


43980 


cnncncccna 


anl:ccaaaaa 


gttcaacacc 


acngagaggg 


tcctacaggg 


tctgctcagg 


44040 


cccttgttca 


agaacaccag 


tgtcagctct 


ctgtactctg 


gttgcagact 


gaccttgctc 


44100 


aggcctgaga 


aggatggggc 


agccaccaga 


gtggatgctg 


cctgcaccta 


ccgccctgat 


44160 


cccaaaagcc 


ctggactgga 


cagagagcaa 


ctatactggg 


agctgagcca 


gctaacccac 


44220 


agcatcactg 


agctgggacc 


ctacaccctg 


gacagggtca gtctctatgt 
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aaccctcgga gctctgtgcc aaccaccagc actcctggga cctccacagt gcacctggca 44340 

acctctggga ctccatcctc cctgcctggc cacacancnn ctgnccctct cctgntnccn 44400 

ttcaccntca acttnaccat caccaacctg cantangngg annacatgcn ncncccnggn 44460 

tccaggaagt tcaacaccac ngagagggtt ctgcagggtc tgctcaaacc cttgttcagg 44520 

aatagcagtc tggaatacct ctattcaggc tgcagactag cctcactcag gccagagaag 44580 

gatagctcag ccatggcagt ggatgccatc tgcacacatc gccctgaccc tgaagacctc 44640 

ggactggaca gagagcgact gtactgggag ctgagcaatc tgacaaatgg catccaggag 44700 

ctgggcccct acaccctgga ccggaacagt ctctacgtca atggtttcac ccatcggagc 44760 

tctgggctca ccaccagcac tccttggact tccacagttg accttggaac ctcagggact 44820 

ccatcccccg tccccagccc cacaactgct ggccctctcc tggtgccatt caccctcaac 44880 

ttcaccatca ccaacctgca gtatgaggag gacatgcatc gccctggttc caggaggttc 44940 

aacaccacgg agagggttct gcagggtctg ctcacgccct tgttcaagaa caccagtgtt 45000 

ggccctctgt actctggctg cagactgacc ttgctcagac ctgagaagca agaggcagcc 45060 

actggagtgg acaccatctg tacccaccgc gttgatccca tcggacctgg actggacaga 45120 

gagcggctat actgggagct gagccagctg accaacagca tcacagagct gggaccctac 45180 

accctggata gggacagtct ctatgtcaat ggcttcaacc cttggagctc tgtgccaacc 45240 

accagcactc ctgggacctc cacagtgcac ctggcaacct ctgggactcc atcctccctg 45300 

cctggccaca cagcccctgt ccctctcttg ataccattca ccctcaactt taccatcacc 45360 

gacctgcatt atgaagaaaa catgcaacac cctggttcca ggaagttcaa caccacggag 45420 

agggttctgc agggtctgct caagcccttg ttcaagagca ccac,cgttgg ccctctgtac 45480 

tctggctgca gactgacctt gctcagacct gagaaacatg gggcagccac tggagtggac 45540 

gccatctgca ccctccgcct tgatcccact ggtcctggac tggacagaga gcggctatac 45600 

tgggagctga gccagctgac caacagcgtt acagagctgg gcccctacac cctggacagg 45660 

gacagtctct atgtcaatgg cttcacccat cggagctctg tgccaaccac cagtattcct 45720 

gggacctctg cagtgcacct ggaaacctct gggactccag cctccctccc tggccacaca 45780 

gcccctggcc ctctcctggt gccattcacc ctcaacttca ctatcaccaa cctgcagtat 45840 

gaggaggaca tgcgtcaccc tggttccagg aagttcagca ccacggagag agtcctgcag 45900 

ggtctgctca agcccttgtt caagaacacc agtgtcagct ctctgtactc tggttgcaga 45960 

ctgaccttgc tcaggcctga gaaggatggg gcagccacca gagtggatgc tgtctgcacc 46020 
catcgtcctg accccaaaag ccctggactg gacagagagc ggctgtactg gaagctgagc 46080 
cagctgaccc acggcatcac tgagctgggc ccctacaccc tggacaggca cagtctctat 46140 
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gtcaatggtt 


tcacccatca 


gagctctatg 


acgaccacca 


gaactcctga 


tacctccaca 


46200 


atgcacctgg 


caacctcgag 


aactccagcc 


tccctgtctg 


gacctacgac 


cgccagccct 


46260 


ctcctggtgc 


tattcacaat 


taacttcacc 


atcactaacc 


tgcggtatga 


ggagaacatg 


46320 


catcaccctg 


gctctagaaa 


gtttaacacc 


acggagagag 


tccttcaggg 


tctgctcagg 


46380 


cctgtgttca 


agaacaccag 


tgttggccct 


ctgtactctg 


gctgcagact 


gaccacgctc 


46440 


aggcccaaga 


aggatggggc 


agccaccaaa 


gtggatgcca 


tctgcaccta 


ccgccctgat 


46500 


cccaaaagcc 


ctggactgga 


cagagagcag 


ctatactggg 


agctgagcca 


gctaacccac 


46560 


agcatcactg 


agctgggccc 


ctacacccag 


gacaQQQaca 


gtctctatgt 


caatQQcttc 


46620 


acccatcgga 


gctctgtgcc 


aaccaccagt 


attcctggga 


cctctgcagt 


Qcacctaaaa 


46680 


acctctggga 


ctccagcctc 


cctccctggc 


cacacagccc 


ctggccctct 


cctggtgcca 


46740 


ttcaccctca 


acttcactat 


caccaacctg 


cagtatgagg 


aggacatgcg 


tcaccctggt 


46800 


tccaggaagt 


tcaacaccac 


ggagagagtc 


ctgcagggtc 


tgctcaagcc 


cttattcaaa 


46860 


agcaccagtg 


ttggccctct 


gtactctggc 


tgcagactga 


ccttactcaa 


Qcctaaaaaa 


46920 


cgtggggcag 


ccaccggcgt 


ggacaccatc 


tgcactcacc 


accttaaccc 


tctaaaccca 


46980 


ggactggaca 


gagagcagct 


atactgggag 


ctgagcaaac 


tQaCCCQtQQ 


catcatcaaa 


47040 


ctgggcccct 


acctcctgga 


cagaggcagt 


ctctatgtca 


atggtttcac 


ccatcggacc 


47100 


tctgtgccca 


ccaccagcac 


tcctgggacc 


tccacagtgg 


accttggaac 


ctcaaaaact 


47160 


ccattctccc 


tcccaagccc 


cgcancnnct 


gnccctctcc 


tgntnccntt 


caccntcaac 


47220 


ttnaccatca 


ccaacctgca 


ntangnggan 


nacatgcnnc 


ncccnggntc 


caaaaao't'tc 


47280 


accaccacng 


agagggtcct 


gcagactctg 


cttggtccta 


tgttcaagaa 


caccaatatt 


47340 


ggccttctgt 


actctggctg 


cagactgacc 


ttgctcaggt 


ccoaoaacoa 




47400 


actggagtgg 


atgccatctg 


cacccaccgt 


cttgacccca 


aaagccctgg 


aataqacaaq 


47460 


gagcaactat 


actgggagct 


gagccagctg 


accaatggca 


ttaaagaact 


gggcccctac 


47520 


accctggaca 


ggaacagtct 


ctatgtcaat 


gggttcaccc 


attggatccc 


tgtgcccacc 


47580 


agcagcactc 


ctgggacctc 


cacagtggac 


cttgggtcag 


ggactccatc 


ctccctcccc 


47640 


agccccacaa 


ctgctggccc 


tctcctggtg 


ccgttcaccc 


tcaacttcac 


catcaccaac 


47700 


ctgaagtacg 


aggaggacat 


gcattgccct 


ggctccagga 


agttcaacac 


cacagagaga 


47760 


gtcctgcaga 


gtctgcttgg 


tcccatgttc 


aagaacacca 


gtgttggccc 


tctgtactct 


47820 


ggctgcagac 


tgaccttgct 


caggtccgag 


aaggatggag 


cagccactgg 


agtggatgcc 


47880 


atctgcaccc 


accgtcttga 


ccccaaaagc 


cctggagtgg 


acagggagca 


gctatactgg 


47940 


gagctgagcc 


agctgaccaa 


tggcatcaaa 


gagctgggtc 


cctacaccct 


ggacagaaac 


48000 


agtctctatg 


tcaatggttt 


cacccatcag 


acctctgcgc ccaacaccag 
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acctccacag tggaccttgg gacctcaggg actccatcct ccctccccag ccctacancn 48120 

nctgnccctc tcctgntncc nttcaccntc aacttnacca tcaccaacct gcantangng 48180 

gannacatgc nncncccngg ntccaggaag ttcaacacca cngagngngt nctgcagggt 48240 

ctgctnnnnc ccntnttcaa gaacnccagt gtnggccntc tgtactctgg ctgcagactg 48300 

acctnnctca ggncngagaa gnatggngca gccactggan tggatgccat ctgcanccac 48360 

cnncntnanc ccaaaagncc tggactgnac agngagcngc tntactggga gctnagccan 48420 

ctgaccaann ncatcnnnga gctgggnccc tacaccctgg acaggnacag tctctatgtc 48480 

aatggtttca cccattggat ccctgtgccc accagcagca ctcctgggac ctccacagtg 48540 

gaccttgggt cagggactcc atcctccctc cccagcccca caactgctgg ccctctcctg 48600 

gtgccgttca ccctcaactt caccatcacc aacctgaagt acgaggagga catgcattgc 48660 

cctggctcca ggaagttcaa caccacagag agagtcctgc agagtctgct tggtcccatg 48720 

ttcaagaaca ccagtgttgg ccctctgtac tctggctgca gactgacctc gctcaggtcc 48780 

gagaaggatg gagcagccac tggagtggat gccatctgca cccaccgtgt tgaccccaaa 48840 

agccctggag tggacaggga gcagctatac tgggagctga gccagctgac caatggcatc 48900 

aaagagctgg gtccctacac cctggacaga aacagtctct atgtcaatgg tttcacccat 48960 

cagacctctg cgcccaacac cagcactcct gggacctcca cagtgnacnt nggnacctcn 49020 

gggactccat cctccntccc cngccncaca tctgctggcc ctctcctggt gccattcacc 49080 

ctcaacttca ccatcaccaa cctgcagtac gaggaggaca tgcatcaccc aggctccagg 49140 

aagttcaaca ccacggagcg ggtcctgcag ggtctgcttg gtcccatgtt caagaacacc 49200 

agtgtcggcc ttctgtactc tggctgcaga ctgaccttgc tcaggcctga gaagaatggg 49260 

gcaaccactg gaatggatgc catctgcacc caccgtcttg accccaaaag ccctggactg 49320 

nacagngagc ngctntactg ggagctnagc canctgacca annncatcnn ngagctgggn 49380 

ccctacaccc tggacaggna cagtctctat gtcaatggtt tcacccatcn ganctctgng 49440 

cccaccacca gcactcctgg gacctccaca gtgnacntng gnacctcngg gactccatcc 49500 

tccntccccn gccncacanc nnctgnccct ctcctgntnc cnttcaccnt caacttnacc 49560 

atcaccaacc tgcantangn ggannacatg cnncncccng gntccaggaa gttcaacacc 49620 

acngagaggg ttctgcaggg tctgctcaaa cccttgttca ggaatagcag tctggaatac 49680 

ctctattcag gctgcagact agcctcactc aggccagaga aggatagctc agccatggca 49740 

gtggatgcca tctgcacaca tcgccctgac cctgaagacc tcggactgga cagagagcga 49800 

ctgtactggg agctgagcaa tctgacaaat ggcatccagg agctgggccc ctacaccctg 49860 

gaccggaaca gtctctatgt caatggtttc acccatcgaa gctctatgcc caccaccagc 49920 
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actcctggga 


cctccacagt 


ggatgtggga 


acctcaggga 


ctccatcctc 


cagccccagc 


49980 


cccacgactg 


ctggccctct 


cctgatacca 


ttcaccctca 


acttcaccat 


caccaacctg 


50040 


cagtatgggg 


aggacatggg 


tcaccctggc 


tccaggaagt 


tcaacaccac 


agagagggtc 


50100 


ctgcagggtc 


tgcttggtcc 


catattcaag 


aacaccagtg 


ttggccctct 


gtactctggc 


50160 


tgcagactga 


cctctctcag 


gtctgagaag 


gatggagcag 


ccactggagt 


ggatgccatc 


50220 


tgcatccatc 


atcttgaccc 


caaaagccct 


ggactcaaca 


gagagcggct 


gtactgggag 


50280 


ctgagccaac 


tgaccaatgg 


catcaaagag 


ctgggcccct 


acaccctgga 


caggaacagt 


50340 


ctctatgtca 


atggtttcac 


ccatcggacc 


tctgtgccca 


ccaccagcac 


tcctgggacc 


50400 


tccacagtgg 


accttggaac 


ctcagggact 


ccattctccc 


tcccaagccc 


cgcaactgct 


50460 


ggccctctcc 


tggtgctgtt 


caccctcaac 


ttcaccatca 


ccaacctgaa 


gtatgaggag 


50520 


gacatgcatc 


gccctggctc 


caggaagttc 


aacaccactg 


agagggtcct 


gcagactctg 


50580 


cttggtccta 


tgttcaagaa 


caccagtgtt 


ggccttctgt 


actctggctg 


cagactgacc 


50640 


ttgctcaggt 


ccgagaagga 


tggagcagcc 


actggagtgg 


atgccatctg 


cacccaccgt 


50700 


cttgacccca 


aaagccctgg 


actgnacagn 


gagcngctnt 


actgggagct 


nagccanctg 


50760 


accaannnca 


tcnnngagct 


gggnccctac 


accctggaca 


ggnacagtct 


ctatgtcaat 


50820 


ggtttcaccc 


atcnganctc 


tgngcccacc 


accagcactc 


ctgggacctc 


cacagtgnac 


50880 


ntnggnacct 


cngggactcc 


atcctccntc 


cccngccnca 


cancnnctgn 


ccctctcctg 


50940 


ntnccnttca 


ccntcaactt 


naccatcacc 


aacctgcant 


angngganna 


catgcnncnc 


51000 


ccnggntcca 


ggaagttcaa 


caccacngag 


agagtccttc 


agggtctgct 


caggcctgtg 


51060 


ttcaagaaca 


ccagtgttgg 


ccctctgtac 


tctggctgca 


gactgacctt 


gctcaggccc 


51120 


aagaaggatg 


gggcagccac 


caaagtggat 


gccatctgca 


cctaccgccc 


tgatcccaaa 


51180 


agccctggac 


tggacagaga 


gcagctatac 


tgggagctga 


gccagctaac 


ccacagcatc 


51240 


actgagctgg 


gcccctacac 


ccaggacagg 


gacagtctct 


atgtcaatgg 


cttcacccat 


51300 


cggagctctg 


tgccaaccac 


cagtattcct 


gggacctctg 


cagtgcacct 


ggaaaccact 


51360 


gggactccat 


cctccttccc 


cggccacaca 


gagcctggcc 


ctctcctgat 


accattcact 


51420 


ttcaacttta 


ccatcaccaa 


cctgcgttat 


gaggaaaaca 


tgcaacaccc 


tggttccagg 


51480 


aagttcaaca 


ccacggagag 


ggttctgcag 


ggtctgctca 


cgcccttgtt 


caagaacacc 


51540 


agtgttggcc 


ctctgtactc 


tggctgcaga 


ctgaccttgc 


tcagacctga 


gaagcaggag 


51600 


gcagccactg 


gagtggacac 


catctgtacc 


caccgcgttg 


atcccatcgg 


acctggactg 


51660 


gacagagagc 


ggctatactg 


ggagctgagc 


cagctgacca 


acagcatcac 


agagctggga 


51720 


ccctacaccc 


tggataggga 


cagtctctat 


gtcgatggct 


tcaacccttg 


gagctctgtg 


51780 


ccaaccacca 


gcactcctgg 


gacctccaca 


gtgcacctgg 


caacctctgg 


gactccatcc 


51840 
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cccctgcctg gccacacagc ccctgtccct ctcttgatac cattcaccct caactttacc 51900 

atcaccgacc tgcattatga agaaaacatg caacaccctg gttccaggaa gttcaacacc 51960 

acggagaggg ttctgcaggg tctgctcaag cccttgttca agagcaccag cgttggccct 52020 

ctgtactctg gctgcagact gaccttgctc agacctgaga aacatggggc agccactgga 52080 

gtggacgcca tctgcaccct ccgccttgat cccactggtc ctggactgga cagagagcgg 52140 

ctatactggg agctgagcca gctgaccaac agcatcacag agctgggacc ctacaccctg 52200 

gatagggaca gtctctatgt caatggcttc aacccttgga gctctgtgcc aaccaccagc 52260 

actcctggga cctccacagt gcacctggca acctctggga ctccatcctc cctgcctggc 52320 

cacacaactg ctggccctct cctggtgccg ttcaccctca acttcaccat caccaacctg 52380 

aagtacgagg aggacatgca ttgccctggc tccaggaagt tcaacaccac agagagagtc 52440 

ctgcagagtc tgcatggtcc catgttcaag aacaccagtg ttggccctct gtactctggc 52500 

tgcagactga ccttgctcag gtccgagaag gatggagcag ccactggagt ggatgccatc 52560 

tgcacccacc gtcttgaccc caaaagccct ggactgnaca gngagcngct ntactgggag 52620 

ctnagccanc tgaccaannn catcnnngag ctgggnccct acaccctgga caggnacagt 52680 

ctctatgtca atggtttcac ccatcnganc tctgngccca ccaccagcac tcctgggacc 52740 

tccacagtgn acntnggnac ctcngggact ccatcctccn tccccngccn cacancnnct 52800 

gnccctctcc tgntnccntt caccntcaac ttnaccatca ccaacctgca ntangnggan 52860 

nacatgcnnc ncccnggntc caggaagttc aacaccacng agngngtnct gcagggtctg 52920 

ctnnnncccn tnttcaagaa cnccagtgtn ggccntctgt actctggctg cagactgacc 52980 

tnnctcaggn cngagaagna tggngcagcc actggantgg atgccatctg canccaccnn 53040 

cntnanccca aaagncctgg actgnacagn gagcngctnt actgggagct nagccanctg 53100 

accaacagca tcacagagct gggaccctac accctggata gggacagtct ctatgtcaat 53160 

ggtttcaccc atcgaagctc tatgcccacc accagtattc ctgggacctc tgcagtgcac 53220 

ctggaaacct ctgggactcc agcctccctc cctggccaca cagcccctgg ccctctcctg 53280 

gtgccattca ccctcaactt cactatcacc aacctgcagt atgaggagga catgcgtcac 53340 

cctggttcca ggaagttcaa caccacggag agagtcctgc agggtctgct caagcccttg 53400 

ttcaagagca ccagtgttgg ccctctgtac tctggctgca gactgacctt gctcaggcct 53460 

gaaaaacgtg gggcagccac cggcgtggac accatctgca ctcaccgcct tgaccctcta 53520 

aaccctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 53580 

nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 53640 

cnganctctg ngcccaccac cagcactcct gggacctcca cagtgnacnt nggnacctcn 53700 
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gggactccat cctccntccc cngccncaca ncnnctgncc ctctcctgnt nccnttcacc 53760 

ntcaacttna ccatcaccaa cctgcantan gnggannaca tgcnncnccc nggntccagg 53820 

aagttcaaca ccacngagng ngtnctgcag ggtctgctnn nncccntntt caagaacncc 53880 

agtgtnggcc ntctgtactc tggctgcaga ctgacctnnc tcaggncnga gaagnatggn 53940 

gcagccactg gantggatgc catctgcanc caccnncntn ancccaaaag ncctggactg 54000 

nacagngagc ngctntactg ggagctnagc canctgacca annncatcnn ngagctgggn 54060 

ccctacaccc tggacaggna cagtctctat gtcaatggtt ttcaccctcg gagctctgtg 54120 

ccaaccacca gcactcctgg gacctccaca gtgcacctgg caacctctgg gactccatcc 54180 

tccctgcctg gccacacagc ccctgtccct ctcttgatac cattcaccct caactttacc 54240 

atcaccaacc tgcattatga agaaaacatg caacaccctg gttccaggaa gttcaacacc 54300 

acggagcggg tcctgcaggg tctgcttggt cccatgttca agaacacaag tgtcggcctt 54360 

ctgtactctg gctgcagact gaccttgctc aggcctgaga agaatggggc agccactgga 54420 

atggatgcca tctgcagcca ccgtcttgac cccaaaagcc ctggactgna cagngagcng 54480 

ctntactggg agctnagcca nctgaccaan nncatcnnng agctgggncc ctacaccctg 54540 

gacaggnaca gtctctatgt caatggtttc acccatcnga nctctgngcc caccaccagc 54600 

actcctggga cctccacagt gnacntnggn acctcnggga ctccatcctc cntccccngc 54660 

cncacancnn ctgnccctct cctgntnccn ttcaccntca acttnaccat caccaacctg 54720 

cantangngg annacatgcn ncncccnggn tccaggaagt tcaacaccac ngagngngtn 54780 

ctgcagggtc tgctnnnncc cntnttcaag aacnccagtg tnggccntct gtactctggc 54840 

tgcagactga cctnnctcag gncngagaag natggngcag ccactggant ggatgccatc 54900 

tgcanccacc nncntnancc caaaagncct ggactgnaca gngagcngct ntactgggag 54960 

ctnagccanc tgaccaannn catcnnngag ctgggnccct acaccctgga caggnacagt 55020 

ctctatgtca atggtttcac ccatcagaac tctgtgccca ccaccagtac tcctgggacc 55080 

tccacagtgt actgggcaac cactgggact ccatcctcct tccccggcca cacagagcct 55140 

ggccctctcc tgataccatt cactttcaac tttaccatca ccaacctgca ttatgaggaa 55200 

aacatgcaac accctggttc caggaagttc aacaccacgg agagggttct gcagggtctg 55260 

ctcacgccct tgttcaagaa caccagtgtt ggccctctgt actctggctg cagactgacc 55320 

ttgctcagac ctgagaagca ggaggcagcc actggagtgg acaccatctg tacccaccgc 55380 

gttgatccca tcggacctgg actgnacagn gagcngctnt actgggagct nagccanctg 55440 

accaannnca tcnnngagct gggnccctac accctggaca ggnacagtct ctatgtcaat 55500 

ggtttcaccc atcnganctc tgngcccacc accagcactc ctgggacctc cacagtgnac 55560 

ntnggnacct cngggactcc atcctccntc cccngccnca cancnnctgn ccctctcctg 55620 
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ntnccnttca ccntcaactt naccatcacc aacctgcant angngganna catgcnncnc 55680 

ccnggntcca ggaagttcaa caccacngag ngngtnctgc agggtctgct nnnncccntn 55740 

ttcaagaacn ccagtgtngg ccntctgtac tctggctgca gactgacctn nctcaggncn 55800 

gagaagnatg gngcagccac tggantggat gccatctgca nccaccnncn tnancccaaa 55860 

agncctggac tgnacagnga gcngctntac tgggagctna gccanctgac caannncatc 55920 

nnngagctgg gnccctacac cctggacagg nacagtctct atgtcaatgg tttcacccat 55980 

cggagctctg tgccaaccac cagcagtcct gggacctcca cagtgcacct ggcaacctct 56040 

gggactccat cctccctgcc tggccacaca gcccctgtcc ctctcttgat accattcacc 56100 

ctcaacttta ccatcaccaa cctgcattat gaagaaaaca tgcaacaccc tggttccagg 56160 

aagttcaaca ccacggagag ggttctgcag ggtctgctca agcccttgtt caagagcacc 56220 

agtgttggcc ctctgtactc tggctgcaga ctgaccttgc tcagacctga gaaacatggg 56280 

gcagccactg gagtggacgc catctgcacc ctccgccttg atcccactgg tcctggactg 56340 

nacagngagc ngctntactg ggagctnagc canctgacca annncatcnn ngagctgggn 56400 

ccctacaccc tggacaggna cagtctctat gtcaatggtt tcacccatcn ganctctgng 56460 

cccaccacca gcactcctgg gacctccaca gtgnacntng gnacctcngg gactccatcc 5652Q 

tccntccccn gccncacanc nnctgnccct ctcctgntnc cnttcaccnt caacttnacc 56580 

atcaccaacc tgcantangn ggannacatg cnncncccng gntccaggaa gttcaacacc 56640 

acngagngng tnctgcaggg tctgctnnnn cccntnttca agaacnccag tgtnggccnt 56700 

ctgtactctg gctgcagact gacctnnctc aggncngaga agnatggngc agccactgga 56760 

ntggatgcca tctgcancca ccnncntnan cccaaaagnc ctggactgna cagngagcng 56820 

ctntact^g agctnagcca nctgaccaan nncatcnnng agctgggncc ctacaccctg 56880 

gacaggnaca gtctctatgt caatggtttc acccatcgga cctctgtgcc caccaccagc 56940 

actcctggga cctccacagt gcacctggca acctctggga ctccatcctc cctgcctggc 57000 

cacacagccc ctgtccctct cttgatacca ttcaccctca actttaccat caccaacctg 57060 

cagtatgagg aggacatgca tcgccctgga tctaggaagt tcaacaccac agagagggtc 57120 

ctgcagggtc tgcttagtcc cattttcaag aactccagtg ttggccctct gtactctggc 57180 

tgcagactga cctctctcag gcccgagaag gatggggcag caactggaat ggatgctgtc 57240 

tgcctctacc accctaatcc caaaagacct gggctggaca gagagcagct gtactgcgag 57300 

ctaagccagc tgacccacaa catcactgag ctgggcccct acagcctgga cagggacagt 57360 

ctctatgtca atggtttcac ccatcagaac tctgtgccca ccaccagtac tcctgggacc 57420 

tccacagtgt actgggcaac cactgggact ccatcctcct tccccggcca cacancnnct 57480 
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caccntcaac ttnaccatca ccaacctgca ntangnggan 


57540 


nacatgcnnc 


ncccnggntc 


caggaagttc 


aacaccacng 


agngngtnct 


gcagggtctg 


57600 


ctnnnncccn 


tnttcaagaa 


cnccagtgtn 


ggccntctgt 


actctggctg 


cagactgacc 


57660 


tnnctcaggn 


cngagaagna 


tggngcagcc 


actggantgg 


atgccatctg 


canccaccnn 


57720 


cntnanccca 


aaagncctgg 


actgnacagn 


gagcngctnt 


actgggagct 


nagccanctg 


57780 


accaannnca 


tcnnngagct 


gggnccctac 


accctggaca 


ggnacagtct 


ctatgtcaat 


57840 


ggtttcaccc 


attggagctc 


tgggctcacc 


accagcactc 


cttggacttc 


cacagttgac 


57900 


cttggaacct 


cagggactcc 


atcccccgtc 


cccagcccca 


caactgctgg 


ccctctcctg 


57960 


gtgccattca 


ccctaaactt 


caccatcacc 


aacctgcagt 


atgaggagga 


catgcatcgc 


58020 


cctggatcta 


ggaagttcaa 


cgccacagag 


agggtcctgc 


agggtctgct 


tagtcccata 


58080 


ttcaagaaca 


ccagtgttgg 


ccctctgtac 


tctggctgca 


gactgacctt 


gctcagacct 


58140 


gagaagcagg 


aggcagccac 


tggagtggac 


accatctgta 


cccaccgcgt 


tgatcccatc 


58200 


ggacctggac 


tgnacagnga 


gcngctntac 


tgggagctna 


gccanctgac 


caannncatc 


58260 


nnngagctgg 


gnccctacac 


cctggacagg 


nacagtctct 


atgtcaatgg 


tttcacccat 


58320 


cnganctctg 


ngcccaccac 


cagcactcct 


gggacctcca 


cagtgnacnt 


nggnacctcn 


58380 


gggactccat 


cctccntccc 


cngccncaca 


ncnnctgncc 


ctctcctgnt 


nccnttcacc 


58440 


ntcaacttna 


ccatcaccaa 


cctgcantan 


gnggannaca 


tgcnncnccc 


nggntccagg 


58500 


aagttcaaca 


ccacngagng 


ngtnctgcag 


ggtctgctnn 


nncccntntt 


caagaacncc 


58560 


agtgtnggcc 


ntctgtactc 


tggctgcaga 


ctgacctnnc 


tcaggncnga 


gaagnatggn 


58620 


gcagccactg 


gantggatgc 


catctgcanc 


caccnncntn 


ancccaaaag 


ncctggactg 


58680 


nacagngagc 


ngctntactg 


ggagctnagc 


canctgacca 


annncatcnn 


ngagctgggn 


58740 


ccctacaccc 


tggacaggna 


cagtctctat 


gtcaatggtt 


tcacccatcg 


gagctttggg 


58800 


ctcaccacca 


gcactccttg 


gacttccaca 


gttgaccttg 


gaacctcagg 


gactccatcc 


58860 


cccgtcccca 


gccccacaac 


tgctggccct 


ctcctggtgc 


cattcaccct 


aaacttcacc 


58920 


atcaccaacc 


tgcagtatga 


ggaggacatg 


catcgccctg 


gctccaggaa 


gttcaacacc 


58980 


acggagaggg 


tccttcaggg 


tctgcttacg 


cccttgttca 


ggaacaccag 


tgtcagctct 


59040 


ctgtactctg 


gttgcagact 


gaccttgctc 


aggcctgaga 


aggatggggc 


agccaccaga 


59100 




"t" r* "t" n /" a r* r* ^ a 


tcgtcctgac 


cccaaaagcc 


ctggactgna 


cagngagcng 




ctntactggg 


agctnagcca 


nctgaccaan 


nncatcnnng 


agctgggncc 


ctacaccctg 


59220 


gacaggnaca 


gtctctatgt 


caatggtttc 


acccatcnga 


nctctgngcc 


caccaccagc 


59280 


actcctggga 


cctccacagt 


gnacntnggn 


acctcnggga 


ctccatcctc 


cntccccngc 


59340 


cncacancnn 


ctgnccctct 


cctgntnccn 


ttcaccntca acttnaccat 
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cantangngg annacatgcn ncncccnggn tccaggaagt tcaacaccac ngagngngtn 59460 

ctgcagggtc tgctnnnncc cntnttcaag aacnccagtg tnggccntct gtactctggc 59520 

tgcagactga cctnnctcag gncngagaag natggngcag ccactggant ggatgccatc 59580 

tgcanccacc nncntnancc caaaagncct ggactgnaca gngagcngct ntactgggag 59640 

ctnagccanc tgaccaannn catcnnngag ctgggnccct acaccctgga caggnacagt 59700 

ctctatgtca atggtttcac ccattggatc cctgtgccca ccagcagcac tcctgggacc 59760 

tccacagtgg accttgggtc agggactcca tcctccctcc ccagccccac aactgctggc 59820 

cctctcctgg taccattcac cctcaacttc accatcacca acctgcagta tggggaggac 59880 

atgggtcacc ctggctccag gaagttcaac accacagaga gggtcctgca gggtctgctt 59940 

ggtcccatat tcaagaacac cagtgttggc cctctgtact ctggctgcag actgacctct 60000 

ctcaggtccg agaaggatgg agcagccact ggagtggatg ccatctgcat ccatcatctt 60060 

gaccccaaaa gccctggact gnacagngag cngctntact gggagctnag ccanctgacc 60120 

aannncatcn nngagctggg nccctacacc ctggacaggn acagtctcta tgtcaatggt 60180 

ttcacccatc nganctctgn gcccaccacc agcactcctg ggacctccac agtgnacntn 60240 

ggnacctcng ggactccatc ctccntcccc ngccncacan cnnctgnccc tctcctgntn 60300 

ccnttcaccn tcaacttnac catcaccaac ctgcantang nggannacat gcnncncccn 60360 

ggntccagga agttcaacac cacngagngn gtnctgcagg gtctgctnnn ncccntnttc 60420 

aagaacncca gtgtnggccn tctgtactct ggctgcagac tgacctnnct caggncngag 60480 

aagnatggng cagccactgg antggatgcc atctgcancc accnncntna ncccaaaagn 60540 

cctggactgn acagngagcn gctntactgg gagctnagcc anct,-ccaa nnncatcnnn 60600 

gagctgggnc cctacaccct ggacaggnac agtctctatg tcaatggttt cacccatcag 60660 

acctttgcgc ccaacaccag cactcctggg acctccacag tggaccttgg gacctcaggg 60720 

actccatcct ccctccccag ccctacatct gctggccctc tcctggtgcc attcaccctc 60780 

aacttcacca tcaccaacct gcagtacgag gaggacatgc atcacccagg ctccaggaag 60840 

ttcaacacca cggagcgggt cctgcagggt ctgcttggtc ccatgttcaa gaacaccagt 60900 

gtcggccttc tgtactctgg ctgcagactg accttgctca ggcctgagaa gaatggggca 60960 

gccaccagag tggatgctgt ctgcacccat cgtcctgacc ccaaaagccc tggactgnac 61020 

agngagcngc tntactggga gctnagccan ctgaccaann ncatcnnnga gctgggnccc 61080 

tacaccctgg acaggnacag tctctatgtc aatggtttca cccatcngan ctctgngccc 61140 

accaccagca ctcctgggac ctccacagtg nacntnggna cctcngggac tccatcctcc 61200 

ntccccngcc ncacagcccc tgtccctctc ttgataccat tcaccctcaa ctttaccatc 61260 

Page 167 



wo 2004/045553 ^ PCT/US2003/037041 



022438 . 43867(pct2) . ST2 5 . txt 

accaacctgc attatgaaga aaacatgcaa caccctggtt ccaggaagtt caacaccacg 61320 

gagagggttc tgcagggtct gctcaagccc ttgttcaaga gcaccagcgt tggccctctg 61380 

tactctggct gcagactgac cttgctcaga cctgagaaac atggggcagc cactggagtg 61440 

gacgccatct gcaccctccg ccttgatccc actggtcctg gactggacag agagcggcta 61500 

tactgggagc tgagccagct gaccaacagc gttacagagc tgggccccta caccctggac 61560 

agggacagtc tctatgtcaa tggcttcacc cagcggagct ctgtgccaac caccagtatt 61620 

cctgggacct ctgcagtgca cctggaaacc tctgggactc cagcctccct ccctggccac 61680 

acagcccctg gccctctcct ggtgccattc accctcaact tcactatcac caacctgcag 61740 

tatgaggtgg acatgcgtca ccctggttcc aggaagttca acaccacgga gagagtcctg 61800 

cagggtctgc tcaagccctt gttcaagagc accagtgttg gccctctgta ctctggctgc 61860 

agactgacct tgctcaggcc tgaaaaacgt ggggcagcca ccggcgtgga caccatctgc 61920 

actcaccgcc ttgaccctct aaaccctgga ctggacagag agcagctata ctgggagctg 61980 

agcaaactga cccgtggcat catcgagctg ggcccctacc tcctggacag aggcagtctc 62040 

tatgtcaatg gtttcaccca tcggaacttt gtgcccatca ccagcactcc tgggacctcc 62100 

acagtacacc taggaacctc tgaaactcca tcctccctac ctagacccat agtgcctggc 62160 

cctctcctgg tgccattcac cctcaacttc accatcacca acttgcagta tgaggaggcc 62220 

atgcgacacc ctggctccag gaagttcaat accacggaga gggtcctaca gggtctgctc 62280 

aggcccttgt tcaagaatac cagtatcggc cctctgtact ccagctgcag actgaccttg 62340 

ctcaggccag agaaggacaa ggcagccacc agagtggatg ccatctgtac ccaccaccct 62400 

ga' cctcaaa gccctggact gaacagagag cagctgtact gggagctgag ccagctgacc 62460 

cacggcatca ctgagctggg cccctacacc ctggacaggg acagtctcta tgtcgatggt 62520 

ttcactcatt ggagccccat accgaccacc agcactcctg ggacctccat agtgaacctg 62580 

ggaacctctg ggatcccacc ttccctccct gaaactacan cnnctgnccc tctcctgntn 62640 

ccnttcaccn tcaacttnac catcaccaac ctgcantang nggannacat gcnncncccn 62700 

ggntccagga agttcaacac cacngagagg gttctgcagg gtctgctcaa gcccttgttc 62760 

aagagcacca gtgttggccc tctgtattct ggctgcagac tgaccttgct caggcctgag 62820 

aaggacggag tagccaccag agtggacgcc atctgcaccc accgccctga ccccaaaatc 62880 

cctgggctag acagacagca gctatactgg gagctgagcc agctgaccca cagcatcact 62940 

gagctgggac cctacaccct ggatagggac agtctctatg tcaatggttt cacccagcgg 63000 

agctctgtgc ccaccaccag cactcctggg actttcacag tacagccgga aacctctgag 63060 

actccatcat ccctccctgg ccccacagcc actggccctg tcctgctgcc attcaccctc 63120 

aattttacca tcactaacct gcagtatgag gaggacatgc atcgccctgg ctccaggaag 63180 
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ttcaacacca cggagagggt ccttcagggt ctgcttatgc ccttgttcaa gaacaccagt 63240 

gtcagctctc tgtactctgg ttgcagactg accttgctca ggcctgagaa ggatggggca 63300 

gccaccagag tggatgctgt ctgcacccat cgtcctgacc ccaaaagccc tggactggac 63360 

agagagcggc tgtactggaa gctgagccag ctgacccacg gcatcactga gctgggcccc 63420 

tacaccctgg acaggcacag tctctatgtc aatggtttca cccatcagag ctctatgacg 63480 

accaccagaa ctcctgatac ctccacaatg cacctggcaa cctcgagaac tccagcctcc 63540 

ctgtctggac ctacgaccgc cagccctctc ctggtgctat tcacaattaa cttcaccatc 63600 

actaacctgc ggtatgagga gaacatgcat caccctggct ctagaaagtt taacaccacg 63660 

gagagagtcc ttcagggtct gctcaggcct gtgttcaaga acaccagtgt tggccctctg 63720 

tactctggct gcagactgac cttgctcagg cccaagaagg atggggcagc caccaaagtg 63780 

gatgccatct gcacctaccg ccctgatccc aaaagccctg gactggacag agagcagcta 63840 

tactgggagc tgagccagct aacccacagc atcactgagc tgggccccta caccctggac 63900 

agggacagtc tctatgtcaa tggtttcaca cagcggagct ctgtgcccac cactagcatt 63960 

cctgggaccc ccacagtgga cctgggaaca tctgggactc cagtttctaa acctggtccc 64020 

tcggctgcca gccctctcct ggtgctattc actctcaact tcaccatcac caacctgcgg 64080 

tatgaggaga acatgcagca ccctggctcc aggaagttca acaccacgga gagggtcctt 64140 

cagggcctgc tcaggtccct gttcaagagc accagtgttg gccctctgta ctctggctgc 64200 

agactgactt tgctcaggcc tgaaaaggat gggacagcca ctggagtgga tgccatctgc 64260 

acccaccacc ctgaccccaa aagccctagg ctggacagag agcagctgta ttgggagctg 64320 

agccagctga cccacaatat cactgagctg ggccactatg ccctggacaa cgacagcctc 64380 

tttgtcaatg gtttcactca tcggagctct gtgtccacca ccagcactcc tgggaccccc 64440 

acagtgtatc tgggagcatc taagactcca gcctcgatat ttggcccttc agctgccagc 64500 

catctcctga tactattcac cctcaacttc accatcacta acctgcggta tgaggagaac 64560 

atgtggcctg gctccaggaa gttcaacact acagagaggg tccttcaggg cctgctaagg 64620 

cccttgttca agaacaccag tgttggccct ctgtactctg gctccaggct gaccttgctc 64680 

aggccagaga aagatgggga agccaccgga gtggatgcca tctgcaccca ccgccctgac 64740 

cccacaggcc ctgggctgga cagagagcag ctgtatttgg agctgagcca gctgacccac 64800 

agcatcactg agctgggccc ctacacactg gacagggaca gtctctatgt caatggtttc 64860 

acccatcgga gctctgtacc caccaccagc accggggtgg tcagcgagga gccattcaca 64920 

ctgaacttca ccatcaac^a cctgcgctac atggcggaca tgggccaacc cggctccctc 64980 

aagttcaaca tcacagacaa cgtcatgaag cacctgctca gtcctttgtt ccagaggagc 65040 
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agcctgggtg cacggtacac aggctgcagg gtcatcgcac taaggtctgt gaagaacggt 65100 

gctgagacac gggtggacct cctctgcacc tacctgcagc ccctcagcgg cccaggtctg 65160 

cctatcaagc aggtgttcca tgagctgagc cagcagaccc atggcatcac ccggctgggc 65220 

ccctactctc tggacaaaga cagcctctac cttaacggtt acaatgaacc tggtctagat 65280 

gagcctccta caactcccaa gccagccacc acattcctgc ctcctctgtc agaagccaca 65340 

acagccatgg ggtaccacct gaagaccctc acactcaact tcaccatctc caatctccag 65400 

tattcaccag atatgggcaa gggctcagct acattcaact ccaccgaggg ggtccttcag 65460 

cacctgctca gacccttgtt ccagaagagc agcatgggcc ccttctactt gggttgccaa 65520 

ctgatctccc tcaggcctga gaaggatggg gcagccactg gtgtggacac cacctgcacc 65580 

taccaccctg accctgtggg ccccgggctg gacatacagc agctttactg ggagctgagt 65640 

cagctgaccc atggtgtcac ccaactgggc ttctatgtcc tggacaggga tagcctcttc 65700 

atcaatggct atgcacccca gaatttatca atccggggcg agtaccagat aaatttccac 65760 

attgtcaact ggaacctcag taatccagac cccacatcct cagagtacat caccctgctg 65820 

aigggacatcc aggacaaggt caccacactc tacaaaggca gtcaactaca tgacacattc 65880 

cgcttctgcc tggtcaccaa cttgacgatg gactccgtgt tggtcactgt caaggcattg 65940 

ttctcctcca atttggaccc cagcctggtg gagcaagtct ttctagataa gaccctgaat 66000 

gcctcattcc attggctggg ctccacctac cagttggtgg acatccatgt gacagaaatg 66060 

gagtcatcag tttatcaacc aacaagcagc tccagcaccc agcacttcta cctgaatttc 66120 

accatcacca acctaccata ttcccaggac aaagcccagc caggcaccac caattaccag 66180 

aggaacaaaa ggaatattga ggatgcgctc aaccaactct tccgaaacag cagcatcaag 66240 

agttattttt ctgactgtca agtttcaaca ttcaggtctg tccccaacag gcaccacacc 66300 

ggggtggact ccctgtgtaa cttctcgcca ctggctcgga gagtagacag agttgccatc 66360 

tatgaggaat ttctgcggat gacccggaat ggtacccagc tgcagaactt caccctggac 66420 

aggagcagtg tccttgtgga tgggtattct cccaacagaa atgagccctt aactgggaat 66480 

tctgaccttc ccttctgggc tgtcatcctc atcggcttgg caggactcct gggactcatc 66540 

acatgcctga tctgcggtgt cctggtgacc acccgccggc ggaagaagga aggagaatac 66600 

aacgtccagc aacagtgccc aggctactac cagtcacacc tagacctgga ggatctgcaa 66660 

tgactggaac ttgccggtgc ctggggtgcc tttcccccag ccagggtcca aagaagcttg 66720 

gctggggcag aaataaacca tattggtcgg aaaaaaaaaa aaaaa 66765 

<210> 5 

<211> 22152 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> miscfeature 

<222> (13880).. (13880) . . ^ 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eature 

<222> (13887).. (13887) . . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eatu re 

<222> (13890).. (13891) . • ^ aa 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (13893).. (13893) „ 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (13899).. (13899) ^, . . , .i 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc^f eature 

<222> (13903) .. (13903) . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (13911) .. (13918) . , 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (13926).. (13926) . .. 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eature 

<222> (13928) .. (13928) . • a 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eature 

<222> (13930).. (13930) . • , 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eature 

<222> (13936) .. (13938) . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (13940).. (13941) . ... 

<223> xaa can be any naturally occurring amino acid 

<220> 
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<222> (14569) . . (14571) 
<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eatu re 
<222> (14575) . , (14575) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_feature 
<222> (14587) . . (14587) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (14591) . . (14591) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 
<222> (14593) . . (14594) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 
<222> (14725) . . (14727) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc-feature 
<222> (14731) . . (14731) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (14743).. (14743) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (14747).. (14747) 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 
<222> (14749) . . (14750) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (15661).. (15663) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> raisc_feature 
<222> (15667).. (15667) 

<223> xaa can be any naturally occurring amino acid 
<220> 

•<221> misc_feature 
<222> (15679) . . (15679) 

<223> Xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_feature 

<222> C15683) . . (15683) 

<223> xaa can be any naturally occurring amino aci 
<220> 

<221> mi sc_f eature 

<222> (15685) . . (15686) 

<223> Xaa can be any naturally occurring amino aci 
<220> 

<221> mi sc_f eature 

<222> (15972) . . (15974) 

<223> xaa can be any naturally occurring amino aci 
<220> 

<221> mi sc_f eature 

<222> (15978) .. (15978) 

<223> Xaa can be any naturally occurring amino aci 

<220> 

<221> mi sc_f eature 

<222> (15990) . . (15990) 

<223> xaa can be any naturally occurring amino aci 
<220> 

<221> mi sc_f eature 

<222> (15994).. (15994) , . . . 

<223> Xaa can be any naturally occurring ammo aci 



<222> (15996) .. (15997) . . . 

<223> xaa can be any naturally occurring amino acn 

<220> 

<221> mi sc_f eature 

<222> (16015) .. (16015) 

<223> Xaa can be any naturally occurring amino acn 
<220> 

<221> mi sc_f eature 

<222> (16017).. (16017) ^ . . . 

<223> Xaa can be any naturally occurring amino acn 

<220> 

<221> mi sc_f eature 

<222> (16020).. (16020) 

<223> Xaa can be any naturally occurring amino acK 
<220> 

<221> mi sc_f eature 

<222> (1602 5).. (16025) 

<223> Xaa can be any naturally occurring amino aci 
<220> 

<221> mi sc_f eature 

<222> (16037) . . (16037) 

<223> Xaa can be any naturally occurring amino aci 
<220> 

<221> mi sc_f eatu re 



<220> 

<221> 



mi sc_feature 
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<222> (16040) .. C16041) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16045) . . (16045) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16048) . . (16049) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> raisc_feature 

<222> (16051) . . (16054) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16057).. (16058) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (16065) . . (16065) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16072).. (16072) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16075) . . (16076) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> rai sc_f eatu re 

<222> (16078) . . (16078) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16084) . . (16084) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16088) . . (16088) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (16376) . . (16376) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16383).. (16383) 

<223> Xaa can be any naturally occurring amino acid 
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Glu Leu Thr Ser Ser Gin Thr lie Ala Glu Glu Glu Gly Thr Thr 
3095 3100 3105 



Ala Glu Thr Gin Thr Leu Thr Phe Thr Pro Ser Glu Thr pro Thr 
3U0 3115 3120 



Ser Leu Leu Pro Val ser ser Pro Thr Glu pro Thr Ala Arg Arg 
3125 3130 3135 



ser Ser Pro Glu Thr Trp Ala ser Ser lie ser val Pro Ala 
3140 3145 3150 



Thr ser Leu Val Glu Thr Thr Asp Gly Thr Leu Val Thr Thr 
3155 3160 3165 



lie Lys Met Ser Ser Gin Ala Ala Gin Gly Asn sen Thr Trp Pro 
3170 3175 3180 



Ala pro Ala Glu Glu Thr Gly Thr ser Pro Ala Gly Thr ser Pro 
3185 3190 3195 



ser Pro Glu val Ser Thr Thr Leu Lys lie Met ser ser Lys 
3200 3205 3210 



Glu Pro ser lie Ser Pro Glu lie Arg ser Thr val Arg Asn ser 
3215 3220 3225 



Pro Trp Lys Thr Pro Glu Thr Thr val Pro Met Glu Thr Thr val 
3230 3235 3240 



Glu pro Val Thr Leu Gin ser Thr Ala Leu Gly Ser Gly ser Thr 
3245 3250 3255 



ser lie Ser His Leu Pro Thr Gly Thr Thr ser Pro Thr Lys ser 
3260 3265 3270 



pro Thr Glu Asn Met Leu Ala Thr Glu Arg val ser Leu ser Pro 
3275 3280 3285 



ser pro Pro Glu Ala Trp Thr Asn Leu Tyr ser Gly Thr Pro Gly 
3290 3295 3300 



Gly Thr Arg Gin ser Leu Ala Thr Met ser ser Val Ser Leu Glu 



3305 



3310 



3315 
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Ser Ala Glu Ser Gly Leu Leu Thr Pro val ser Ala ser ser ser 
2855 2860 2865 



Pro Ser Lys Ala Phe Ala Ser Leu Thr Thr Ala Pro Pro Ser Thr 
2870 2875 2880 



Trp Gly He Pro Gin ser Thr Leu Thr Phe Glu Phe Ser Glu val 
2885 2890 2895 



Pro Ser Leu Asp Thr Lys Ser Ala Ser Leu Pro Thr Pro Gly Gin 
2900 2905 2910 



ser Leu Asn Thr lie Pro Asp Ser Asp Ala ser Thr Ala Ser Ser 
2915 2920 2925 



Ser Leu Ser Lys Ser Pro Glu Lys Asn Pro Arg Ala Arg Met Met 
2930 2935 2940 



Thr Ser Thr Lys Ala lie ser Ala ser ser Phe Gin Ser Thr Gly 
2945 2950 2955 



Phe Thr Glu Tlir Pro Glu Gly ser Ala ser pro ser Met Ala Gly 
2960 2965 ' 2970 



His Glu Pro Arg val Pro Thr Ser Gly Thr Gly Asp Pro Arg Tyr 
2975 2980 2985 



Ala ser Glu Ser Met ser Tyr Pro Asp Pro ser Lys Ala ser ser 
2990 2995 3000 



Ala Met Thr ser Thr ser Leu Ala ser Lys Leu Thr Thr Leu Phe 
3005 3010 3015 



ser Thr Gly Gin Ala Ala Arg ser Gly ser Ser ser ser Pro lie 
3020 3025 3030 



ser Leu Ser Thr Glu Lys Glu Thr Ser Phe Leu ser Pro Thr Ala 
3035 3040 3045 



Ser Thr Ser Arg Lys Thr ser Leu Phe Leu Gly Pro ser Met Ala 
3050 3055 3060 



Arg Gin Pro Asn lie Leu val His Leu Gin Thr ser Ala Leu Thr 
3065 3070 3075 



Leu Ser Pro Thr ser Thr Leu Asn Met Ser Gin Glu Glu Pro Pro 



Page 213 



wo 2004/04S5S3 



PCT/US2003/037041 





02243 8 . 43867 (pct2) . ST2 5 . txt 



Thr ASP Thr lie Leu Phe Ser Thr val Ser Thr pro Pro ser Lys 
2615 2620 262S 



Phe Pro ser Thr Gly Thr Leu Ser Gly Ala ser Phe Pro Thr Leu 
2630 2635 2640 



Leu pro ASP Thr Pro Ala lie Pro Leu Thr Ala Thr Glu Pro Thr 
2645 2650 2655 



ser ser Leu Ala Thr ser Phe Asp ser Thr Pro Leu val Thr lie 
2660 2665 2670 



Ala ser Asp ser Leu Gly Thr val Pro Glu Thr Thr Leu Thr Met 
2675 2680 2685 



ser Glu Thr ser Asn Gly Asp Ala Leu val Leu Lys Thr val ser 
2690 2695 2700 



Asn Pro Asp Arg ser lie Pro Gly lie Thr lie Gin Gly val Thr 
2705 2710 2715 



Glu Ser Pro Leu His Pro Ser Ser Thr Ser Pro Ser Lys lie Val 
2720 2725 2730 



Ala Pro Arg Asn Thr Thr Tyr Glu Gly Ser lie Thr val Ala Leu 
2735 2740 27-45 

ser Thr Leu Pro Ala Gly Thr Thr Gly Ser Leu val Phe Ser Gin 
2750 2755 2760 



ser Ser Glu Asn Ser Glu Thr Thr Ala Leu Val Asp Ser ser Ala 
2765 2770 2775 



Leu Glu Arg Ala ser Val Met Pro Leu Thr Thr Gly Ser Gin 
2780 2785 2790 



Gly Met Ala Ser Ser Gly Gly lie Arg Ser Gly Ser Thr His ser 
2795 2800 2805 



Thr Gly Thr Lys Thr Phe ser Ser Leu Pro Leu Thr Met Asn Pro 
2810 2815 2820 



Gly Glu Val Thr Ala Met ser Glu He Thr Thr Asn Arg Leu Thr 
2825 2830 2835 



Ala Thr Gin ser Thr Ala Pro Lys Gly He Pro val Lys Pro Thr 



2840 



2845 



2850 
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lie Asn Thr Ser lie Ala ser ser Thr Met Asp val Thr Lys Thr 
2375 2380 2385 



Ala ser He Ser Pro Thr ser lie ser 6ly Met Thr Ala ser ser 
2390 2395 2400 



Ser Pro ser Leu Phe ser ser Asp Arg pro Gin val Pro Thr ser 
2405 2410 2415 



Thr Thr Glu Thr Asn Thr Ala Thr ser Pro Ser val Ser Ser Asn 
2420 2425 2430 



Thr Tyr Ser Leu Asp Gly Gly ser Asn val Gly Gly Thr Pro ser 
2435 2440 2445 



Thr Leu Pro Pro Phe Thr lie Thr His Pro Val Glu Thr ser ser 
2450 2455 2460 



Ala Leu Leu Ala Trp ser Arg Pro val Arg Thr Phe ser Thr Met 
2465 2470 2475 



val Ser Thr Asp Thr Ala Ser Gly Glu Asn Pro Thr Ser ser Asn 
2480 2485 2490 



Ser val Val Thr Ser Val Pro Ala Pro Gly Thr Trp Ala Ser Val 
2495 2500 2505 



Gly ser Thr Thr Asp Leu Pro Ala Met Gly Phe Leu Lys Thr ser 
2510 2515 2520 



pro Ala Gly Glu Ala His ser Leu Leu Ala ser Thr He Glu Pro 
2525 2530 2535 



Ala Thr Ala Phe Thr Pro His Leu Ser Ala Ala Val Val Thr Gly 
2540 2545 2550 



ser ser Ala Thr ser Glu Ala ser Leu Leu Thr Thr ser Glu ser 
2555 2560 2565 



Lys Ala He His Ser ser Pro Gin Thr Pro Thr Thr Pro Thr ser 
2570 2575 2580 



Gly Ala Asn Trp Glu Thr Ser Ala Thr pro Glu ser Leu Leu val 
2585 2590 2595 



Val Thr Glu Thr ser Asp Thr Thr Leu Thr ser Lys lie Leu val 



2600 



2605 



2610 
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2135 2140 2145 

Ala lie Thr ser Arg Ser Asp val ser Gly Leu Thr ser Glu Ser 
2150 2155 2160 

Thr Ala Asn Pro Ser Leu Gly Thr Ala ser Ser Ala Gly Thr Lys 
2165 2170 2175 

Leu Thr Arg Thr lie ser Leu Pro Thr ser Glu ser Leu val ser 
2180 2185 2190 

Phe Arg Met Asn Lys Asp Pro Trp Thr Val ser lie Pro Leu Gly 
2195 2200 2205 

ser His Pro Thr Thr Asn Thr Glu Thr Ser lie Pro val Asn ser 
2210 2215 2220 

Ala Gly Pro Pro Gly Leu ser Thr Val Ala Ser Asp Val lie Asp 
2225 2230 2235 

Thr Pro Ser Asp Gly Ala Glu Ser He Pro Thr val ser Phe Ser 
2240 2245 2250 

Pro Ser Pro Asp Thr Glu val Thr Thr lie ser His Phe Pro Glu 
2255 2260 2265 

Lys Thr Thr His Ser Phe Arg Thr lie ser ser Leu Tt>r His Glu 
2270 2275 2280 

Leu Thr Ser Arg val Thr Pro lie Pro Gly Asp Trp Met Ser ser 
2285 2290 2295 

Ala Met ser Thr Lys Pro Thr Gly Ala ser pro ser lie Thr Leu 
2300 2305 2310 

Gly Glu Arg Arg Thr lie Thr ser Ala Ala pro Thr Thr ser Pro 
2315 2320 2325 

lie val Leu Thr Ala ser Phe Thr Glu Thr ser Thr val ser Leu 
2330 2335 2340 

Asp Asn Glu Thr Thr Val Lys Thr ser Asp lie Leu Asp Ala Arg 
2345 2350 2355 

Lys Thr Asn Glu Leu Pro ser Asp ser ser ser ser ser Asp Leu 
2360 2365 2370 
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His pro Arg Pro Glu Thr Ser Ala Pro Gly Ala Glu Thr Thr Leu 
1910 1915 1920 

Thr ser Thr Pro Gly Asn Arg Ala lie Ser Leu Thr Leu Pro Phe 
1925 1930 1935 

Ser Ser lie Pro val Glu Glu val lie Ser Thr Gly lie Thr Ser 
1940 1945 1950 

Gly Pro Asp He Asn ser Ala Pro Met Thr His Ser Pro lie Thr 
1955 1960 1965 

pro pro Thr lie val Trp Thr Ser Thr Gly Thr He Glu Gin ser 
1970 1975 1980 

Thr Gin Pro Leu His Ala val ser ser Glu Lys val ser val Gin 
1985 1990 1995 

Thr Gin Ser Thr Pro Tyr val Asn ser val Ala val ser Ala ser 
2000 2005 2010 

Pro Thr His Glu Asn ser val ser Ser Gly ser ser Thr ser ser 
2015 2020 2025 

Pro Tyr ser ser Ala Ser Leu Glu ser Leu Asp Ser Thr lie ser 
2030 2035 2040 

Arg Arg Asn Ala lie Thr ser Trp Leu Trp Asp Leu Thr Thr Ser 
2045 2050 2055 

Leu Pro Thr Thr Thr Trp Pro ser Thr Ser Leu ser Glu Ala Leu 
2060 2065 2070 

ser ser Gly His Ser Gly val Ser Asn Pro Ser Ser Thr Thr Thr 
2075 2080 2085 

Glu Phe Pro Leu Phe Ser Ala Ala Ser Thr ser Ala Ala Lys Gin 
2090 2095 2100 

Arg Asn Pro Glu Thr Glu Thr His Gly Pro Gin Asn Thr Ala Ala 
2105 2110 2115 

Ser Thr Leu Asn Thr Asp Ala Ser Ser Val Thr Gly Leu Ser Glu 
2120 2125 2130 

Thr Pro Val Gly Ala Ser He ser Ser Glu Val Pro Leu pro Met 
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ser Ser Thr Ala Thr Val Ser Met Ala Gly Thr Met Gly Leu 
1670 1675 1680 



Leu val Thr ser Ala Pro Gly Arg ser lie ser Gin Ser Leu Gly 
1685 1690 1695 



val Ser ser val Leu ser Glu ser Thr Thr Glu Gly Val Thr 
1700 1705 1710 



ASP ser ser Lys Gly ser ser Pro Arg Leu Asn Thr Gin Gly Asn 
1715 1720 1725 



Thr Ala Leu ser ser Ser Leu Glu Pro Ser Tyr Ala Glu Gly Ser 
1730 1735 1740 



Gin Met Ser Thr ser lie Pro Leu Thr Ser ser pro Thr Thr Pro 
1745 1750 1755 



ASP Val Glu Phe lie Gly Gly ser Thr Phe Trp Thr Lys Glu Val 
1760 1765 1770 



Thr Thr Val Met Thr Ser Asp lie ser Lys ser ser Ala Arg Thr 
1775 1780 1785 



Glu ser ser ser Ala Thr Leu Met Ser Thr Ala Leu Gly ser Thr 
1790 1795 1800 



Glu Asn Thr Gly Lys Glu Lys Leu Arg Thr Ala Ser Met Asp Leu 
1805 1810 1815 



pro Ser Pro Thr Pro Ser Met Glu Val Thr Pro Trp lie Ser Leu 
1820 1825 1830 



Thr Leu Ser Asn Ala Pro Asn Thr Thr Asp ser Leu Asp Leu Ser 
1835 1840 1845 



His Gly val His Thr ser ser Ala Gly Thr Leu Ala Thr Asp Arg 
1850 1855 I860 



Ser Leu Asn Thr Gly val Thr Arg Ala Ser Arg Leu Glu Asn Gly 
1865 1870 1875 



Ser Asp Thr Ser Ser Lys ser Leu Ser Met Gly Asn ser Thr His 
1880 1885 1890 



Tlir ser Met Thr Asp Thr Glu Lys ser Glu val Ser Ser Ser He 



1895 



1900 



1905 
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Ser ser Thr Leu Glu Leu Thr ser Asp Val Pro Gly Glu Pro Leu 
1430 1435 1440 



Val Leu Ala Pro ser Glu Gin Thr Thr lie Thr Leu Pro Met Ala 
1445 1450 1455 



Thr Trp Leu ser Thr ser Leu Thr Glu Glu Met Ala ser Thr Asp 
1460 1465 1470 



Leu Asp lie ser ser Pro Ser Ser Pro Met Ser Thr Phe Ala He 
1475 1480 1485 



Phe pro Pro Met ser Thr Pro ser His Glu Leu ser Lys ser Glu 
1490 1495 1500 



Ala Asp Thr ser Ala lie Arg Asn Thr Asp ser Thr Thr Leu Asp 
1505 1510 1515 



Gin His Leu Gly lie Arg Ser Leu Gly Arg Thr Gly Asp Leu Thr 
1520 1525 1530 



Thr val Pro lie Thr Pro Leu Thr Thr Thr Trp Thr ser val lie 
1535 1540 1545 

Glu His Ser Thr Gin Ala Gin Asp Thr Leu Ser Ala Thr Met Ser 
1550 1555 1560 



pro Thr His Val Thr Gin Ser Leu Lys Asp Gin Thr Ser lie Pro 
1565 1570 1575 



Ala Ser Ala Ser Pro ser His Leu Thr Glu Val Tyr Pro Glu Leu 
1580 1585 1590 



Gly Thr Gin Gly Arg Ser Ser Ser Glu Ala Thr Thr Phe Trp Lys 
1595 1600 1605 



Pro ser Thr Asp Thr Leu Ser Arg Glu lie Glu Thr Gly Pro Thr 
1610 1615 1620 



Asn lie Gin Ser Thr Pro Pro Met Asp Asn Thr Thr Thr Gly Ser 
1625 1630 1635 



Ser ser Ser Gly val Thr Leu Gly lie Ala His Leu Pro lie Gly 
1640 1645 1650 



Thr ser Ser Pro Ala Glu Thr ser Thr Asn Met Ala Leu Glu Arg 
1655 1660 1665 
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Ser Asn lie Pro Thr Ser Gly Ala lie Gly Lys ser Thr Leu val 
1205 1210 1215 



Pro Leu Asp Thr Pro ser Pro Ala Thr ser Leu Glu Ala Ser Glu 
1220 1225 1230 



Gly Leu Pro Thr Leu Ser Thr Tyr Pro Glu ser Thr Asn Thr 
1235 1240 1245 



Pro Ser lie His Leu Gly Ala His Ala ser ser Glu Ser Pro ser 
1250 1255 1260 



Thr He Asn Leu Thr Met Ala ser Val val Lys Pro Gly Ser Tyr 
1265 1270 1275 



Thr pro Leu Thr Phe Pro ser lie Glu Thr His lie His val ser 
1280 1285 1290 



Thr Ala Arg Met Ala Tyr Ser ser Gly ser Ser Pro Glu Met Thr 
1295 1300 1305 



Ala Pro Gly Glu Thr Asn Thr Gly Ser Thr Trp Asp pro Thr Thr 
1310 1315 1320 



He Thr Thr Thr Asp Pro Lys Asp Thr Ser Ser Ala Gin val 
1325 1330 1335 



Ser Thr Pro His Ser Val Arg Thr Leu Arg Thr Thr Glu Asn His 
1340 1345 1350 



Pro Lys Thr Glu ser Ala Thr Pro Ala Ala Tyr Ser Gly Ser Pro 
1355 1360 1365 



lie Ser Ser ser Pro Asn Leu Thr ser Pro Ala Thr Lys Ala 
1370 1375 1380 



Trp Thr He Thr Asp Thr Thr Glu His Ser Thr Gin Leu His Tyr 
1385 1390 1395 



Thr Lys Leu Ala Glu Lys Ser ser Gly Phe Glu Thr Gin ser Ala 
1400 1405 1410 



pro Gly Pro val ser val val ile Pro Thr ser Pro Thr lie Gly 



1420 V 



1425 
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Ala Pro Gin Ser Thr Thr Trp Pro Glu Thr Ser pro Arg Phe Lys Thr 
965 970 97S 

Gly Leu Pro Ser Ala Thr Thr Thr Val ser Thr Ser Ala Thr Ser Leu 
980 985 990 

ser Ala Thr Val Met Val ser Lys Phe Thr ser Pro Ala Thr ser ser 
995 1000 1005 

91^^ Arg Glu Pro Ser Thr Thr He Leu Thr 

1010 1015 1020 

5l!ir "^^^ ^'^y Met Ala val Ala ser Thr 

1025 1030 1035 

Jl?^ Gly Thr Glu Gly Arg Leu Asp 

1040 1045 1050 

Thr ser His Leu pro He Gly Thr Thr Ala Ser Ser Glu Thr Ser 
1055 1060 1065 

^12^ '^^^ Met Ala Lys Glu ser val Ser Met ser val Ser 
1070 1075 1080 

fSL M" Asp Ala Ala Gly ser ser Thr pro Gly Arg 

1085 1090 1095 

Thr ser^ Gin Phe val Asp Thr^ Phe ser Asp Asp val^ Tyr His Leu 

Thr ser Arg Glu He Thr He Pro Arg Asp Gly Thr Ser ser Ala 
1115 1120 1125 

Leu Thr Pro Gin Met Thr Ala Thr His Pro Pro Ser pro asd Pro 
1130 1135 1140 

Gly f?"!. Ala Arg Ser Thr Trp Leu Gly He Leu Ser ser Ser Pro 
1145 1150 1155 

Ser ser Pro Thr Pro Lys val Thr Met Ser ser Thr phe Ser Thr 
1160 1165 1170 

Gin Arg Val Thr Thr ser Met ile Met Asp Thr Val Glu Thr ser 
1175 1180 1185 

Arg Trp Asn Met Pro Asn Leu pro Ser Thr Thr Ser Leu Thr Pro 
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His pro Phe ser Ser Pro Glu Pro Asp ser Ala Gly His Thr Lys He 
705 710 715 720 



ser Thr Ser lie Pro Leu Leu Ser ser Ala ser Val Leu Glu Asp Lys 
725 730 735 



val ser Ala Thr Ser Thr Phe Ser His His Lys Ala Thr Ser ser He 
- 740 745 750 



Thr Thr Gly Thr Pro Glu lie Ser Thr Lys Thr Lys Pro Ser Ser Ala 
755 760 765 



Val Leu Ser ser Met Thr Leu Ser Asn Ala Ala Thr ser Pro Glu Arg 
770 775 780 



val Arg Asn Ala Thr ser pro Leu Thr His Pro ser pro ser Gly Glu 
785 790 795 800 



Glu Thr Ala Gly ser val Leu Thr Leu ser Thr Ser Ala Glu Thr Thr 
805 810 815 



ser Pro Asn He His Pro Thr Gly Thr Leu Thr Ser Glu ser Ser 
820 825 830 



Glu Ser pro Ser Thr Leu ser Leu Pro ser val ser Gly val Lys Thr 
835 840 845 



Thr Phe ser ser Ser Thr Pro Ser Thr His Leu Phe Thr Ser Gly Glu 
850 855 860 



Glu Thr Glu Glu Thr ser Asn Pro ser Val Ser Gin Pro Glu Thr Ser 
865 870 875 880 



val ser Arg Val Arg Thr Thr Leu Ala Ser Thr Ser Val Pro Thr Pro 
885 890 895 



Val Phe Pro Thr Met Asp Thr Trp Pro Thr Arg Ser Ala Gin Phe Ser 
900 905 910 



ser ser His Leu Val Ser Glu Leu Arg Ala Thr ser ser Thr Ser Val 
915 920 925 



Thr Asn ser Thr Gly Ser Ala Leu Pro Lys He ser His Leu Thr Gly 
930 935 940 



Thr Ala Thr Met ser Gin Thr Asn Arg Asp Thr Phe Asn Asp Ser Ala 



. 945 



950 



955 
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Ser Glu Met Thr Ala Thr Leu val Pro Thr Leu Gly Phe Thr Thr Leu 
450 455 460 

Asp Ser Lys lie Arg ser pro Ser Gin val ser ser ser His Pro Thr 
465 470 475 480 

Arg Glu Leu Arg Thr Thr Gly ser Thr ser Gly Arg Gin Ser Ser ser 
485 490 495 

Thr Ala Ala His Gly Ser Ser Asp lie Leu Arg Ala Thr Thr ser Ser 
500 505 510 

Thr ser Lys Ala Ser Ser Trp Thr ser Glu Ser Thr Ala Gin Gin Phe 
515 520 525 

ser Glu Pro Gin His Thr Gin Trp val Glu Thr ser Pro ser Met Lys 
530 535 540 

Thr Glu Arg Pro Pro Ala Ser Thr Ser Val Ala Ala Pro lie Thr Thr 
545 550 555 560 

ser val Pro Ser Val Val Ser Gly Phe Thr Thr Leu Lys Thr Ser Ser 
565 570 575 

Thr Lys Gly He Trp Leu Glu Glu Thr Ser Ala Asp Thr Leu lie Gly 
580 585 590 

Glu ser Thr Ala Gly Pro Thr Thr His Gin Phe Ala Val Pro Thr Gly 
595 600 605 

He Ser Met Thr Gly Gly Ser ser Thr Arg Gly Ser Gin Gly Thr Thr 
610 615 620 

His Leu Leu Thr Arg Ala Thr Ala Ser ser Glu Thr Ser Ala Asp Leu 
625 630 635 640 

Thr Leu Ala Thr Asn Gly val Pro Val Ser val ser Pro Ala val Ser 
645 650 655 

Lys Thr Ala Ala Gly ser ser Pro Pro Gly Gly Thr Lys Pro Ser Tyr 
660 665 670 

Thr Met val Ser Ser Val lie Pro Glu Thr ser Ser Leu Gin Ser Ser 
675 680 685 

Ala Phe Arg Glu Gly Thr ser Leu Gly Leu Thr pro Leu Asn Thr Arg 
690 695 700 
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ser Met Thr pro Ala Glu Thr Thr val Thr Asp ser His Thr Pro Gly 
210 215 220 



Arg Thr Asn Pro ser Phe Gly Thr Leu Tyr Ser ser Phe Leu Asp Leu 
225 230 235 240 



ser Pro Lys Gly Thr Pro Asn ser Arg Gly Glu Thr ser Leu Glu Leu 

245 250 255 



lie Leu ser Thr Thr Gly Tyr Pro Phe ser Ser Pro Glu Pro Gly Ser 
260 265 270 



Ala Gly His Ser Arg lie ser Thr ser Ala Pro Leu Ser ser Ser Ala 
27S 280 285 



ser Val Leu Asp Asn Lys lie ser Glu Thr ser lie Phe Ser Gly Gin 
290 295 300 



Ser Leu Thr Ser Pro Leu Ser Pro Gly Val Pro Glu Ala Arg Ala Ser 
305 310 315 320 



Thr Met pro Asn Ser Ala lie Pro Phe ser Met Thr Leu ser Asn Ala 
325 330 335 



Glu Thr ser Ala. Glu Arg Val Arg Ser Thr He Ser Ser Leu Gly Thr 
340 345 350 



Pro Ser lie ser Thr Lys Gin Thr Ala Glu Thr lie Leu Thr Phe His 
355 360 365 



Ala Phe Ala Glu Thr Met Asp He Pro ser Thr His He Ala Lys Thr 
370 375 380 



Leu Ala ser Glu Trp Leu Gly ser Pro Gly Thr Leu Gly Gly Thr Ser 
385 390 395 400 



Thr Ser Ala Leu Thr Thr Thr Ser Pro ser Thr Thr Leu Val Ser Glu 
405 410 415 



Glu Thr Asn Thr His His Ser Thr Ser Gly Lys Glu Thr Glu Gly Thr 
420 425 430 



Leu Asn Thr ser Met Thr Pro Leu Glu Thr ser Ala Pro Gly Glu Glu • 
435 440 445 
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<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> ni1sc_feature 
<222> (20828) . . (20828) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 
<222> (20830) . . (20831) 

<223> Xaa can be any naturally occurring amino acid 
<400> 5 

Met Leu Lys pro ser Gly Leu Pro Gly Ser ser Ser Pro Thr Arg Ser 
15 10 IS 

Leu Met Thr Gly ser Arg ser Thr Lys Ala Thr Pro Glu Met Asp Ser 
20 25 30 

Gly Leu Thr Gly Ala Thr Leu ser Pro Lys Thr Ser Thr Gly Ala lie 
35 40 45 

Val val Thr Glu His Thr Leu Pro Phe Thr Ser Pro Asp Lys Thr Leu 
50 55 60 

Ala Ser Pro Thr Ser ser val Val Gly Arg Thr Thr Gin Ser Leu Gly 
65 70 75 80 

Val Met Ser Ser Ala Leu Pro Glu Ser Thr Ser Arg Gly Met Thr His 
85 90 95 

Ser Glu Gin Arg Thr ser pro Ser Leu ser Pro Gin Val Asn Gly Thr 
100 105 110 

Pro Ser Arg Asn Tyr Pro Ala Thr ser Met val ser Gly Leu ser ser 
115 120 125 

Pro Arg Thr Arg Thr ser ser Thr Glu Gly Asn Phe Thr Lys Glu Ala 
130 135 140 

ser Thr Tyr Thr Leu Thr val Glu Thr Thr ser Gly Pro val Thr Glu 
145 150 155 160 

Lys Tyr Thr Val Pro Thr Glu Thr Ser Thr Thr Glu Gly Asp ser Thr 
165 170 175 

Glu Thr Pro Trp Asp Thr Arg Tyr lie Pro Val Lys lie Thr ser Pro 
180 185 190 

Met Lys Thr Phe Ala Asp ser Thr Ala ser Lys Glu Asn Ala Pro val 
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<220> 

<221> misc_feature 

<222> (20285) . . (20286) 

<223> xaa can be any naturally occurring amino acid 
<22Q> 

<221> misc-feature 

<222> C20288).. (20288) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eatu re 

<222> (20294).. (20294) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (20298) .. (20298) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (20306).. (20313) 

<223> Xaa can be any naturally occurring ammo acid 
<220> 

<221> mi sc_f eatu re 

<222> (20321) .. (20321) . . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (20323) .. (20323) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<22 1> mi s c_f eatu re 

<222> (20325).. (20325) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (20331) .. (20333) . - 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (20335) (20336) • • 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> miscfeature 

<222> (20806) .. (20808) 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (20812) .. (20812) 

<223> Xaa can be any naturally occurring amino acid 
<22Q> 

<221> miscfeature 

<222> (20824).. (20824) 
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<222> (20094) . . (20095) 

<223> Xaa can be any naturally occurring amino acid 
<22Q> 

<221> misc^feature 

<222> (20099) . . (20099) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> miscfeature 

<222> (20102) . . (20103) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (20105) . . (20108) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (20111) . . (20112) 

<223> Xaa can be any naturally occurring amino acid, 
<220> 

<221> misc_feature 

<222> (20119) . . (20119) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (20126) - . (20126) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (20129) . . (20130) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc-feature 

<222> (20132).. (20132) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (20138) . . (20138) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (20142).. (20142) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (20275).. (20275) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (20282) . . (20282) 

<223> xaa can be any naturally occurring amino acid 
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<220> 

<221> niisc_feature 

<222> (20019) .. (20021) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> raisc_feature 

<222> (20023).. (20024) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> misc_feature 

<222> (20026) .. (20028) 

<223> Xaa can be any naturally occurring ammo acid 
<220> 

<221> misc_feature 

<222> (20032) .. (20032) 

<223> Xaa can be any naturally occurring ammo acid 
<220> 

<22a> mi sc_f eatu re 

<222> (20044) .. (20044) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> misc_feature 

<222> (20048) .. (20048) _ . . -j 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (20050) .. (20051) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> roisc-feature 

<222> (20069) .. (20069) 

<223> Xaa can be any naturally occurring ammo acid 
<220> 

<223> mi scL-f eature 

<222> (20071).. (20071) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> misc_feature 

<222> (20074) .. (20074) , . . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (20079) .. (20079) . . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> raisc_feature 

<222> (20091) .. (20091) , . . 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eatu re 
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<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



niisc_feature 
(19821) . . (19821) 

xaa can be any naturally occurring amino acid 



roisc-featurie 
(19827) . . (19827) 

xaa can be any naturally occurring amino acid 



misc_feature 
(19831) . . (19831) 

xaa can be any naturally occurring amino add 



misc_feature 
(19963) . . (19963) 

Xaa can be any naturally occurring amino acid 



misc_feature 
(19970) . . (19970) 

Xaa can be any naturally occurring amino acid 



m1sc_feature 
(19973).. (19974) 

xaa can be any naturally occurring amino acid 



m1sc_feature 
(19976) . . (19976) 

xaa can be any naturally occurring amino acid 



misc_feature 
(19982) . . (19982) 

Xaa can be any naturally occurring amino acid 



ml sc_f eature 
(19986) . . (19986) 

xaa can be any naturally occurring amino acid 



ml sc^f eature 
(19994).. (20001) 

xaa can be any naturally occurring amino add 



mi sc_f eature 
(20009) . . (20009) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(20011) . . (20011) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(20013) . . (20013) 

Xaa can be any naturally occurring amino acid 
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<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (19760) . . (19760) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> raisc_feature 

<222> (19763) . . (19763) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19768) . . (19768) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<2 2 1> ml sc_f eatu re 

<222> (19780) . . (19780) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19783) . . (19784) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> m1sc_feature 

<222> (19788) . . (19788) 

<223> xaa can be any naturally occurring amino add 

<220> 

<221> misc_feature 

<222> (19791) . . (19792) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> miscfeature 

<222> (19794) . . (19797) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mlscL-feature 

<222> (19800) .\ (19801) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19808) . . (19808) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> ml sc_f eatu re 

<222> (19815) . . (19815) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (19818) . . (19819) 

<223> xaa can be any naturally occurring amino acid 



<220> 
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<221> misc^feature 
<222> (19675).. (19675) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> miscfeature 
<222> (19683).. (19690) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> miscfeature 
<222> (19698).. (19698) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> miscfeature 
<222> (19700) (19700) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 
<222> (19702) . . (19702) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19708) . . (19710) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc-feature 

<222> (19712) (19713) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19715) , . (19717) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19721) . . (19721) 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc^feature 

<222> (19733).. (19733) 

<223> xaa can be any naturially occurring amino acid 

<220> 

<221> misc^feature 

<222> (19737). -(19737) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (19739) . . (19740) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (19758).. (19758) 
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<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



(19482)..C19485) 
Xaa can be any naturally occurring amino acid 



misc_feature 
(19488) . . (19489) 

xaa can be any naturally occurring amino acid 



miscfeature 
(19496) . . (19496) 

Xaa can be any naturally occurring amino acid 



raisc_feature 
(19503) . . (19503) 

xaa can be any naturally occurring amino acid 



misc_feature 
(19506) . . (19507) 

xaa can be any naturally occurring amino acid 



misc_feature 
(19509).. (19509) 

xaa can be any naturally occurring amino acid 



misc^feature 
(19515) . . (19515) 

Xaa can be any naturally occurring amino acid 



misc_feature 
(19519) . . (19519) 

Xaa can be any naturally occurring amino acid 



miscfeature 
(19652) . . (19652) 

Xaa can be any naturally occurring amino acid 



misc_feature 
(19659) . . (19659) 

Xaa can be any naturally occurring amino acid 



raisc_feature 
(19662) . . (19663) 

xaa can be any naturally occurring amino acid 



miscfeature 
(19665) . . (19665) 

Xaa can be any naturally occurring amino acid 



miscfeature 
(19671).. (19671) 

Xaa can be any naturally occurring amino acid 
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<220> 

<221> niisc_feature 

<222> C19409) . . (19409) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eatu re 

<222> (19421) . . (19421) 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (19425).. (19425) 

<223> Xaa can be any naturally occurring amino acid 



<222> (19427) . . (19428) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eatu re 

<222> (19446).. (19446) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> m1sc_feature 

<222> (19448) . . (19448) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19451) . . (19451) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (19456) . . (19456) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19468).. (19468) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> miscfeature 

<222> (19471).. (19472) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (19476) . . (19476) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19479) . . (19480) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 



<220> 
<221> 



misc_feature 
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<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<22t> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



misc_feature 
(19340) . . (19340) 

Xaa can be any naturally occurring amino acid 



inisc_feature 
(19347).. (19347) 

Xaa can be any naturally occurring ammo acid 



misc_feature 
(19350).. (19351) 

Xaa can be any naturally occurring amino acid 



misc_feature 
(193 53).. (193 53) 

xaa can be any naturally occurring amino aci 



mi sc_f eature 
(19359). .(19359) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 

(19363).. (19363) . . . 

xaa can be any naturally occurring amino aci 



misc_feature 

(19371).. (19378) . . . 

xaa can be any naturally occurring amino aci 



mi sc_feature 
(19386) . . (19386) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 
(19388) . . (19388) 

Xaa can be any naturally occurring amino aci 



mi sc_f eature 
(19390).. (19390) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(19396) . . (19398) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(19400) .. (19401) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(19403).. (19405) 

Xaa can be any naturally occurring ammo acid 
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<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> C19156) . . (19156) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (19159) . . (19160) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (19164) . . (19164) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (19167) . . (19168) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (19170) . . (19173) 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eatu re 

<222> (19176) . . (19177) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (19184) . . (19184) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature- 

<222> (19191) . . (19191) 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<2 2 1> mi sc_f eatu re 

<222> (19194) . . (19195) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (19197) . . (19197) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (19203) . . (19203) 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (19207) . . (19207) 

<223> xaa can be any naturally occurring amino acid 
<220> 
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<220> 

<221> misc_feature 

<222> (18882) .. (18883) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (18885).. (18885) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

. <221> mi sc_f eatu re 

<222> (18891).. (18891) ^ ■ . . 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18895) .. (18895) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (19091).. (19093) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19097) .. (19097) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19109) .. (19109) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> raisc_feature 

<222> (19113).. (19113) . . 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (19115).. (19116) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (19134).. (19134) . . 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (19136) .. (19136) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (19139) .. (19139) , . . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (19144).. (19144) 
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<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 



(18803) . . C18804) 
xaa can be any naturally occurring amino acid 



misc_feature 
(18822) . . (18822) 

xaa can be any naturally occurring amino acid 



misc_feature 
(18824) . . (18824) 

Xaa can be any naturally occurring amino acid 



misc-feature 
(18827) . . (18827) 

Xaa can be any naturally occurring amino acid 



misc_feature 
(18832) . . (18832) 

Xaa can be any naturally occurring amino acid 



misc_feature 
(18844) . . (18844) 

xaa can be any naturally occurring amino acid 



misc_feature 
(18847) . . (18848) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18852) . . (18852) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 
(1885 5).. (18856) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18858).. (18861) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18864).. (18865) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18872) . . (18872) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18879).. (18879) 

xaa can be any naturally occurring amino acid 
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<220> 

<221> fflisc_feature 

<222> (1873 5).. (1873 5) . . 

<223> Xaa can be any naturally occurring amino acid' 

<220> 

<22I> misc_feature 

<222> (18739).. (18739) _ . . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18747).. (18754) . 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eatu re 

<222> (18762).. (18762) , . . -j 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18764).. (18764) . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (18766) .. (18766) _ . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18772).. (18774) . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18776).. (18777) .. 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18779) .. (18781) . . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc-feature 

<222> (18785) .. (18785) . . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eatu re 

<222> (18797).. (18797) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> misc_feature 

<222> (18801) .. (18801) . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc-feature 
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<221> misc_feature 

<222> (18543) . . (18544) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> inisc_feature 

<222> (18546) . . (18549) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18552).. (18553) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18560) . . (18560) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18567).. (18567) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18570) . . (18571) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18573) . . (18573) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18579) . . (18579) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18583).. (18583) 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18716) . . (18716) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (18723) . . (18723) 

<223> xaa can be any naturally occurring amino- acid 
<220> 

<221> mi sc_f eatu re 

<222> (18726) . . (18727) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (18729) . . (18729) 

<223> xaa can be any naturally occurring amino acid 
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<223> xaa can be any naturally occurring amino acid 

<220> 

<2 21> mi sc^f eatu re 

<222> (18467) (18469) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> misc_feature 

<222> (18473).. (18473) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18485) .. (18485) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

•<221> misc_feature 

<222> (18489) . . (18489) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18491).. (18492) 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mlsc^feature 

<222> (18510) . . (18510) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> m1sc_feature 

<222> (18512) .. (18512) 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (18515).. (18515) . . 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18520) (18520) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> miscfeature 

<222> (18532) . . (18532) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<2 21> mi sc_f eatu re 

<222> (18535) .. (18536) , . . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> miscfeature 

<222> (18540) . . (18540) 

<223> xaa can be any naturally occurring amino acid 
<220> 
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<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 
<223> 

<220> 
<221> 
<222> 



mi sc_f eature 
(18271) . . C18271) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18404).. (18404) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18411) . . (18411) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18414) . . (18415) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18417).. (18417) 

Xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18423).. (18423) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18427).. (18427) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18435) . . (18442) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18450) . . (18450) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18452).. (18452) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18454).. (184S4) 

xaa can be any naturally occurring amino acid 



mi sc_f eature 
(18460) . . (18462) 

xaa can be any naturally occurring amino acid 



mi sc_f eature , 
(18464).. (18465) 
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<222> (18203) .. (18203) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> misc_feature 

<222> (18208) .. (18208) . . .. 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18220).. (18220) . . ^ 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eature 

<222> (18223) (18224) ,^ , , 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eature 

<222> (18228).. (18228) „ . . 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc^f eatu re 

<222> (18231) .. (18232) . 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> miscfeature 

<222> (18234).. (18237) . . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (18240).. (18241) . . , 

<223> xaa can be any naturally occurring ammo acid 

<22 ■>> 

<22j> mi sc_f eature 

<222> (18248) .. (18248) . 

<223> Xaa can be any naturally occurring ammo acid 

<22Q> 

<221> mi sc_f eature 

<222> (18255).. (18255) ^ . . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (18258).. (18259) • ■ a 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (18261) .. (18261) , . • j 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (18267) .. (18267) 

<223> xaa can be any naturally occurring ammo acid 
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<220> 

<221> misc_feature 

<222> (18138) . . (18138) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18140) . . (18140) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18142).. (18142) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<2 21> mi s c_f eatu re 

<222> (18148) . . (18150) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18152) .. (18153) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18155) .. (18157) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (18161).. (18161) 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (18173).. (18173) 

<223> xaa can be any naturally occurring amino e ,id 
<220> 

<221> misc_feature 

<222> (18177).. (18177) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 
<222> (18179) . . (18180) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> raisc_feature 
<222> (18198) .. (18198) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 
<222> (18200).. (18200) 

<223> Xaa can be any naturally occurring amino acic 

<220> 

<221> mi sc_f eature 
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<221> misc_feature 

<222> (17936) .. (17936) . • 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (17943).. (17943) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (17946).. (17947) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (17949).. (17949) . . .. 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eature 

<222> (17955).. (17955) _ ■ -a 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eature 

<222> (17959).. (17959) . . 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eatu re 

<222> (18092).. (18092) .. 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (18099).. (18099) 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (18102) .. (18103) . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (18105) .. (18105) . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (18111) .. (18111) „ . . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_f eature 

<222> (18115) .. (18115) . . 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 
<222> (18123) .. (18130) 

<223> xaa can be any naturally occurring ammo acid 
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<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (17865) . . (17865) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17867) . . (17868) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (17886) . . (17886) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17888).. (17888) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17891) . . (17891) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17896) . . (17896) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> misc_feature 

<222> (17908) . - (17908) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (17911) .. (17912) 

<223> Xaa can be any naturally occurring ammo acid 
<220> 

<221> misc-feature 

<222> (17916).. (17916) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> mi sc_f eatu re 

<222> (17919) . . (17920) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (17922).. (1792 5) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (17928) . . (17929) 

<223> xaa can be any naturally occurring amino acid 
<220> 
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<220> 

<221> misc_feature 

<222> (17790) .. (17791) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> miscfeature 

<222> (17793).. (17793) . . .. 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (17799).. (17799) . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (17803) .. (17803) . . 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<2 2 1> mi sc_f eatu re 

<222> (17811).. (17818) . . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (17826).. (17826) _ .. 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc^feature 

<222> (17828).. (17828) . . .. 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<2 21> mi sc_f eatu re 

<222> (17830).. (17830) . - -a 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> raisc_feature 

<222> (17836) .. (17838) ■ , . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> raisc_feature 

<222> (17840) .. (17841) . . .. 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> mi sc_feature 

<222> (17843) .. (17845) . . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (17849) .. (17849) 

<223> xaa can be any naturally occurring ammo acid 
<220> 

<221> miscfeature 

<222> (17861) . . (17861) 
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<222> (17576) . . (17576) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17579).. (17579) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17584) . . (17584) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17596) . . (17596) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17599) • . (17600) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> miscfeature 

<222> (17604) . . (17604) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (17607) • . (17608) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc-feature 

<222> (17610).. (17613) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (17616) . . (17617) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17624) . . (17624) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17631) . . (17631) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> miscfeature 

<222> (17780).. (17780) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc-feature 

<222> (17787) . . (17787) 

<223> xaa can be any naturally occurring amino acid 
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<220> 

<221> misc_feature 

<222> (17499).. (17 506) 

<223> Xaa can be any naturally occurring anrino acid 
<220> 

<221> niisc_feature 

<222> (17514) . . (17514) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> niisc_feature 

<222> (17516) . . (17516) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (17518) . . (17518) 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (17524) . . (17526) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (17528).. (17529) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (17531) . . (17533) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eature 

<222> (17537) . . (17537) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (17549) . . (17549) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> ml sc_f eatu re 

<222> (17553).. (17553) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (1755 5).. (175 56) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> niisc_feature 

<222> (17574) . . (17574) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 
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<221> misc^feature 

<222> (16900) - . (16902) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (16904) (16905) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> miscfeature 

<222> (16907) . . (16909) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16913).. (16913) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (1692 5).. (1692 5) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (16929).. (16929) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16931).. (16932) . . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (17468) . . (17468) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17475).. (17475) 

<223> Xaa can be any naturally occurring ammo acid 
<220> 

<221> misc^feature 

<222> (17478).. (17479) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eat u re 

<222> (17481).. (17481) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 

<222> (17487) . . (17487) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc^feature 
<222> (17491) . . (17491) 

<223> Xaa can be any naturally occurring amino acid 
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<223>- Xaa can be any naturally occurring amino acid 

<220> 

<221> niisc_feature 

<222> (16461).. (16461) . . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> mi sc_f eatu re 

<222> (16463).. (16464) . 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (16844).. (16844) . . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (16851).. (16851) . . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> raisc_feature 

<222> (16854).. (16855) . - -a 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (16857).. (16857) . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 

<222> (16863).. (16863) . . 

<223> xaa can be any naturally occurring amino acid 

<220> 

<2ri> misc_feature 

<212> (16867).. (16867) . . , 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> raisc_feature 

<222> (16875).. (16882) . ^ .a 

<223> Xaa can be any naturally occurring ammo acid 

<220> 

<221> raisc_feature 

<222> (16890).. (16890) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16892).. (16892) . . 

<223> xaa can be any naturally occurring ammo acid 

<220> 

<221> misc_feature 
<222> (16894) .. (16894) 

<223> xaa can be any naturally occurring ammo acid 
<220> 
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<220> 

<221> misc_feature 

<222> (16386) . . (16387) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16389) . . (16389) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16395) . . (16395) 

<223> Xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (16399) . . (16399) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16407).. (16414) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (16422) .. (16422) 

<223> xaa can be any naturally occurring amino acid 

<220> 

<221> misc_feature 

<222> (16424) .. (16424) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16426) . . (16426) 

<223> xaa can be any naturally occurring amino \cid 
<220> 

<221> misc_feature 

<222> (16432) .. (16434) 

<223> Xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16436) . . (16437) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> misc_feature 

<222> (16439) . . (16441) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eatu re 

<222> (16445) .. (16445) 

<223> xaa can be any naturally occurring amino acid 
<220> 

<221> mi sc_f eature 

<222> (16457) . . (16457) 
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ser Pro Thr Ala Arg Ser He Thr Gly Thr Gly Gin Gin Ser Ser 
3320 3325 3330 

Pro Glu Leu val ser Lys Thr Thr Gly Met Glu Phe ser Met Trp 
3335 3340 3345 

His Gly Ser Thr Gly Gly Thr Thr Gly Asp Thr His Val Ser Leu 
3350 3355 3360 

ser Thr Ser Ser Asn He Leu Glu Asp Pro Val Thr Ser Pro Asn 
3365 3370 3375 

ser val ser Ser Leu Thr Asp Lys Ser Lys His Lys Thr Glu Thr 
3380 3385 3390 

Trp val ser Thr Thr Ala He pro Ser Thr Val Leu Asn Asn Lys 
3395 3400 3405 

He Met Ala Ala Glu Gin Gin Thr ser Arg ser val Asp Glu Ala 
3410 3415 3420 

Tvr ser Ser Thr Ser Ser Trp ser Asp Gin Thr ser Gly ser Asp 
' 3425 3430 3435 

He Thr Leu Gly Ala ser Pro Asp val Thr Asn Thr Leu Tyr He 
3440 3445 3450 

Thr ser Thr Ala Gin Thr Thr ser Leu val ser Leu Pro ser Gly 
3455 3460 3465 

ASP Gin Gly He Thr ser Leu Thr Asn pro ser Gly Gly Lys Thr 
3470 3475 3480 

ser ser Ala ser Ser val Thr ser Pro ser He Gly Leu Glu Thr 
3485 3490 3495 

Leu Arg Ala Asn val ser Ala val Lys ser Asp He Ala Pro Thr 
3500 3505 3510 



Ala Gly His Leu Ser Gin Thr Ser ser Pro Ala Glu Val ser He 
3515 3520 3525 

Leu ASP val Thr Thr Ala Pro Thr Pro Gly He Ser Thr Thr He 
3530 3535 3540 



Thr Thr Met Gly Thr Asn Ser He Ser Thr Thr Thr Pro Asn Pro 
3545 3550 3555 
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Met 
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Asp Ser Thr Pro Ala Thr c 

5 3570 



Glu 



Arg 



Arg Thr Thr ser Thr Glu His pro ser Thr Trp ser Ser Thr Ala 
3575 3580 3585 



Ala ser Asp ser Trp Thr val Thr Asp Met Thr ser Asn Leu 
3590 3595 3600 



Val Ala Arg Ser pro Gly Thr lie ser Thr Met His Thr Thr ser 
3605 3610 3615 



Phe Leu Ala Ser ser Thr Glu Leu Asp ser Met ser Thr Pro His 
3620 3625 3630 



Gly Arg lie Thr val lie Gly Thr ser Leu val Thr Pro Ser ser 
3635 3640 3645 

Asp Ala Ser Ala val Lys Thr Glu Thr Ser Thr ser Glu Arg Thr 
3650 3655 3660 



Leu Ser Pro Ser Asp Thr Thr Ala Ser Thr Pro lie Ser Thr Phe 
3665 3670 3675 



Ser Arg Val Gin Arg Met Ser lie Ser Val pro Asp lie Leu Ser 
3680 3685 3690 



Thr Ser Trp Thr Pro Ser Ser Thr Glu Ala Glu Asp val pro val 
3-595 3700 3705 



ser Met Val Pro Thr Asp His Ala Ser Thr Lys Thr Asp pro Asn 
3710 3715 3720 



Thr Pro Leu ser Thr Phe Leu Phe Asp ser Leu ser Thr Leu Asp 
3725 3730 3735 



Trp Asp Thr Gly Arg ser Leu ser ser Ala Thr Ala Thr Thr Ser 
3740 3745 3750 



Ala pro Gin Gly Ala Thr Thr Pro Gin Glu Leu Thr Leu Glu Thr 
3755 3760 3765 



Met lie Ser Pro Ala Thr Ser Gin Leu Pro Phe ser lie Gly His 
3770 3775 3780 



He. Thr Ser Ala val Thr Pro Ala Ala Met Ala Arg ser Ser Gly 



3785 



3790 



3795 
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val Thr Phe Ser Arg Pro Asp Pro Thr ser Lys Lys Ala Glu Gin 
3800 3805 3810 

Thr ser Thr Gin Leu Pro Thr Thr Thr ser Ala His Pro Gly Gin 
3815 3820 3825 

val pro Arg Ser Ala Ala Thr Thr Leu Asp Val lie Pro His Thr 
3830 3835 3840 

Ala Lys Thr Pro Asp Ala Thr Phe Gin Arg Gin Gly Gin Thr Ala 
3845 3850 3855 

Leu Thr Thr Glu Ala Arg Ala Thr ser Asp Ser Trp Asn Glu Lys 
3860 3865 3870 

Glu Lys ser Thr Pro ser Ala Pro Trp He Thr Glu Met Met Asn 
3875 3880 3885 

ser val Ser Glu Asp Thr He Lys Glu Val Thr ser ser ser ser 
3890 3895 3900 

val Leu Lys Asp Pro Glu Tyr Ala Gly His Lys Leu Gly He Trp 
3905 3910 3915 

Asp Asp Phe He Pro Lys Phe Gly Lys Ala Ala His Met Arg Glu 
3920 3925 3930 

Leu Pro Leu Leu ser Pro Pro Gin Asp Lys Glu Ala He His Pro 
3935 3940 3945 

ser Thr Asn Thr val Glu Thr Thr Gly Trp Val Thr ser Ser Glu 
3950 3955 3960 

His Ala ser His ser Thr He Pro Ala His ser Ala ser Ser Lys 
3965 3970 3975 

Leu Thr ser Pro Val Val Thr Thr Ser Thr Arg Glu Gin Ala He 
3980 3985 3990 

val ser Met Ser Thr Thr Thr Trp Pro Glu ser Thr Arg Ala Arg 
3995 4000 4005 

Thr Glu Pro Asn Ser Phe Leu Thr He Glu Leu Arg Asp val Ser 
4010 4015 4020 

pro Tyr Met Asp Thr ser ser Thr Thr Gin Thr ser He lie ser 
' page 217 



wo 2004/045553 



PCT/US2003/037041 





4025 
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a 4035 



ser Pro Gly ser Thr Ala He Thr Lys Gly His Arg Thr Glu He 
4040 4045 4050 



Thr ser Tyr Lys Arg lie Ser ser Ser Phe Leu Ala Gin Ser Met 
4055 4060 4065 



Arg ser Ser Asp ser Pro ser Glu Ala lie Thr Arg Leu ser Asn 
4070 4075 4080 



Phe Pro Ala Met Thr Glu Ser Gly Gly Met He Leu Ala Met Gin 
4085 4090 4095 



Thr Ser Pro Pro Gly Ala Thr ser lie Ser Ala Pro Thr Leu 
4100 4105 4110 



Thr ser Ala Thr Ala Ser Trp Thr Gly Thr Pro Leu Ala Thr Thr 
4115 4120 4125 



Gin Arg Phe Thr Tyr ser Glu Lys Thr Thr Leu Phe Ser Lys Gly 
4130 4135 4140 



Arg Glu Asp Thr ser Gin Pro ser Pro Pro cys val Glu Glu Thr 
4145 4150 4155 



Ser ser ser ser ser val val pro lie His Ala Thr Thr ser Pro 
4160 4165 4170 



Ser Asn He Leu Leu Thr ser Gin Gly His ser pro Ser ser Thr 
4175 4180 4185 



Pro pro Val Thr ser val Phe Leu Ser Glu Thr Ser Gly Leu 
4190 4195 4200 



Lys Thr Thr Asp Met Ser Arg He Ser Leu Glu Pro Gly Thr ser 
4205 4^10 4215 



Leu Pro Pro Asn Leu Ser Ser Thr Ala Gly Glu Ala Leu ser Thr 
4220 4225 4230 



Tyr Glu Ala ser Arg Asp Thr Lys Ala lie His His ser Ala 
4235 4240 4245 



Thr Ala val Thr Asn Met Glu Ala Thr ser ser Glu Tyr ser Pro 



4250 



4255 



4260 
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lie pro Gly His Thr Lys Pro ser Lys Ala Thr Ser Pro Leu val 
4265 4270 4275 



Thr ser His lie Met Gly Asp lie Thr Ser ser Thr ser val Phe 
4280 4285 4290 



ser ser Glu Thr Thr Glu lie Glu Thr val ser ser val Asn 
4295 4300 4305 



Gin Gly Leu Gin Glu Arg ser Thr Ser Gin Val Ala ser ser Ala 
4310 4315 4320 



Thr Glu Thr ser Thr val lie Thr His val Ser Ser Gly Asp Ala 
4325 4330 4335 



Thr Thr His val Thr Lys Thr Gin Ala Thr Phe Ser ser Gly Thr 
4340 4345 4350 



Ser lie ser ser pro His Gin Phe He Thr Ser Thr Asn Thr Phe 
4355 4360 4365 



Thr Asp val ser Thr Asn Pro Ser Thr ser Leu lie Met Thr Glu 
4370 4375 4380 



ser ser Gly Val Thr He Thr Thr Gin Thr Gly Pro Thr Gly Ala 
4385 4390 4395 



Ala Thr Gin Gly Pro Tyr Leu Leu Asp Thr Ser Thr Met Pro Tyr 
4400 4405 4410 



Leu Thr Glu Thr Pro Leu Ala val Thr Pro Asp Phe Met Gin ser 
4415 4420 4425 



Glu Lys Thr Thr Leu He ser Lys Gly Pro Lys Asp Val Thr Trp 
4^30 4435 4440 



Thr ser Pro Pro Ser Val Ala Glu Thr ser Tyr Pro ser ser Leu 
4445 4450 4455 



Thr Pro Phe Leu val Thr Thr He Pro Pro Ala Thr Ser Thr Leu 
4460 4465 4470 



Gin Gly Gin His Thr Ser ser pro val ser Ala Thr ser val Leu 
4475 4480 4485 



Thr ser Gly Leu val Lys Thr Thr Asp Met Leu Asn Thr ser Met 



4490 



4495 



4500 
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Glu Pro val Thr Asn sar Pro Gin Asn Leu Asn Asn Pro ser Asn 
4505 4510 4515 

Glu lie Leu Ala Thr Leu Ala Ala Thr Thr Asp He Glu Thr lie 
4520 4525 4530 

His Pro Ser lie Asn Lys Ala val Thr Asn Met Gly Thr Ala Ser 
4535 4540 4545 

Ser Ala His val Leu His ser Thr Leu Pro Val ser Ser Glu Pro 
4550 4555 4560 

Ser Thr Ala Thr ser pro Met Val Pro Ala Ser ser Met Gly Asp 
4565 4570 4575 

Ala Leu Ala ser He Ser lie Pro Gly ser Glu Thr Thr Asp He 
4580 4585 4590 

Glu Gly Glu Pro Thr ser ser Leu Thr Ala Gly Arg Lys Glu Asn 
4595 4600 4605 

Ser Thr Leu Gin Glu Met Asn ser Thr Thr Glu Ser Asn He He 
4610 4615 4620 

Leu ser Asn val ser val Gly Ala He Thr Glu Ala Thr Lys Met 
4625 4630 4635 

Glu Val Pro ser Phe Asp Ala Thr Phe lie Pro Thr Pro Ala Gin 
4640 4645 4650 

Ser Thr Lys Phe Pro Asp lie phe ser val Ala ser ser Arg Leu 
4655 4660 4665 

Ser Asn Ser Pro pro Met Thr He ser Thr His Met Thr Thr Thr 
4670 4675 4680 

Gin Thr Gly ser ser Gly Ala Thr Ser Lys He Pro Leu Ala Leu 
4685 4690 4695 

Asp Thr ser Thr Leu Glu Thr ser Ala Gly Thr Pro Ser val val 
4700 4705 4710 

Thr Glu Gly Phe Ala His Ser Lys He Thr Thr Ala Met Asn Asn 
4715 4720 4725 

Asp val Lys Asp val Ser Gin Thr Asn Pro Pro Phe Gin Asp Glu 
4730 4735 4740 
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Ala Ser Ser Pro Ser Sen Gin Ala Pro val Leu Val Thr Thr Leu 
4745 4750 4755 



Pro ser Ser Val Ala Phe Thr Pro Gin Trp His ser Thr Ser ser 
4760 476$ 4770 



pro val Ser Met ser ser Val Leu Thr ser Ser Leu val Lys Thr 
4775 4780 4785 



Ala Glv Lys val Asp Thr ser Leu Glu Thr val Thr ser ser Pro 
4790 4795 4800 

Gin ser Met ser Asn Thr Leu Asp Asp He ser val Thr ser Ala 
4805 4810 4815 

Ala Thr Thr Asp He Glu Thr Thr His Pro ser lie Asn Thr val 
4820 4825 4830 



val Thr Asn val Gly Thr Thr Gly Ser Ala Phe Glu ser His ser 
4835 4840 4845 



Thr val ser Ala Tyr Pro Glu Pro Ser Lys val Thr ser Pro Asn 

4850 4855 4860 

val Thr Thr ser Thr Met Glu Asp Thr Thr lie Ser Arg ser lie 

4865 4870 4875 

pro Lys Ser ser Lys Thr Thr Arg Thr Glu Thr Glu Thr Thr ser 

4880 4885 4890 

ser Leu Thr Pro Lys Leu Arg Glu Thr ser He ser Gin Glu lie 

4895 4900 4905 

■rtir ser Ser Tlir Glu Thr ser Thr val Pro Tyr Lys Glu Leu Thr 

4910 4915 4920 



Ala Thr Thr Glu Val Ser Arg Thr Asp val Thr Ser ser ser 
4925 4930 4935 



ser Thr Ser Phe Pro Gly pro Asp Gin ser Thr val ser Leu Asp 
4940 4945 4950 



He ser Thr Glu Thr Asn Thr Arg Leu Ser Thr ser Pro He Met 
4955 4960 4965 

Thr Glu ser Ala Glu He Thr He Thr Thr Gin Thr Gly Pro His 
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Gly Ala Thr Ser Gin Asp Thr Phe Thr Met Asp Pro Ser Asn Thr 
4985 4990 4995 



Thr pro Gin Ala Gly lie His Ser Ala Met Thr His Gly Phe Ser 
5000 5005 5010 



Gin Leu Asp Val Thr Thr Leu Met Ser Arg lie Pro Gin Asp Val 
5015 5020 5025 



ser Trp Thr Ser Pro Pro ser Val Asp Lys Thr ser ser pro Ser 
5030 5035 5040 



Ser Phe Leu Ser ser Pro Ala Met Thr Thr Pro Ser Leu lie Ser 
5045 5050 5055 



Ser Thr Leu Pro Glu Asp Lys Leu Ser Ser Pro Met Thr ser Leu 
5060 5065 5070 



Leu Thr Ser Gly Leu Val Lys lie Thr Asp lie Leu Arg Thr 
5075 5080 5085 



Leu Glu Pro Val Thr Ser Ser Leu Pro Asn Phe Ser Ser Thr Ser 
5090 5095 5100 



Asp Lys lie Leu Ala Thr ser Lys Asp ser Lys Asp Thr Lys Glu 
5105 5110 5115 



lie Phe Pro ser He Asn Thr Glu Glu Thr Asn val Lys Ala Asn 
5120 5125 5130 



Asn ser Gly His Glu Ser His ser Pro Ala Leu Ala Asp ser Glu 
5135 5140 5145 



Thr Pro Lys Ala Thr Thr Gin Met val He Thr Thr Thr val Gly 
5150 5155 5160 



Asp pro Ala Pro ser Thr ser Met Pro val His Gly ser ser Glu 
5165 5170 5175 



Thr Thr Asn lie Lys Arg Glu Pro Thr Tyr Phe Leu Thr Pro 
5180 5185 5190 



Leu Arg Glu Thr Ser Thr Ser Gin Glu ser ser Phe Pro Thr Asp 
5195 5200 5205 
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Thr ser Phe Leu Leu ser Lys val Pro Thr 6ly Thr n.e Thr Glu 
5210 5215 . 5220 

val ser Ser Thr Gly val He Ser Ser Ser Lys He Ser Thr Pro 
5225 5230 5235 

ASP His Asp Lys ser Thr val Pro Pro Asp Thr Phe Thr Gly Glu 
5240 5245 5250 

He Pro Arg val Phe Thr ser ser lie Lys Thr Lys ser Ala Glu 
5255 5260 5265 

Met Thr He Thr Thr Gin Ala Ser Pro Pro Glu Ser Ala ser His 
5270 5275 5280 

ser Thr Leu Pro Leu Asp Thr Ser Thr Thr Leu ser Gin Gly Gly 
5285 5290 5295 

Thr His Ser Thr val ser Gin Gly Phe pro Tyr ser Glu val Thr 
5300 5305 5310 

Thr Leu Met Gly Met Gly pro Gly Asn val ser Trp Met Thr Thr 
5315 5320 5325 

pro pro Val Glu Glu Thr ser ser Val Ser ser Leu Met Ser ser 
5330 5335 5340 

Pro Ala Met Thr ser Pro ser pro Val Ser ser Thr ser Pro Gin 
5345 5350 5355 

ser lie Pro ser Ser Pro Leu Pro val Thr Ala Leu Pro Thr ser 
5360 5365 5370 

val Leu val Thr Thr Thr Asp val Leu Gly Thr Thr Ser Pro Glu 
5375 5380 5385 

ser Val Thr Ser Ser Pro Pro Asn Leu Ser Ser lie Thr His Glu 
5390 5395 5400 

Arg Pro Ala Thr Tyr Lys Asp Thr Ala His Thr Glu Ala Ala Met 
5405 5410 5415 

His His Ser Thr Asn Thr Ala val Thr Asn Val Gly Thr ser Gly 
5420 5425 5430 

ser Gly His Lys ser Gin ser ser val Leu Ala Asp ser Glu Thr 
5435 5440 5445 
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t^S^ '^^^ ^fi"" Leu Met ser Thr Ala Ser Thr Leu G 
5450 545S ' 5460 



Thr ser val ser Thr Ser Thr Pro Asn He Ser Gin Thr Asn Gin 
5465 5470 5475 



"Thr Ala ser Leu Ser Pro Arg Leu Arg Glu 
5480 5485 5490 



f?L "-ys Thr Ser Ser Thr Thr Glu Thr Asn Thr 

5495 5500 5505 



'^^^ ^^f" '^^ ''•'o Gly Ala He Thr Gin Ala Ser 
5510 5515 5520 



Thr Glu He Ser ser Ser Arg Thr ser He ser Asp Leu Asp Arg 
5525 5530 5535 



ser Thr He Ala Pro Asp lie ser Thr Gly Met He Thr Arq Leu 
5540 5545 5550 



Phe Thr ser pro He Met Thr Lys Ser Ala Glu' Met Thr val Thr 
5555 5560 5565 



Thr Gin Thr Thr Thr Pro Gly Ala Thr ser Gin Gly He Leu Pro 
5570 5575 5580 



Trp Arp Thr ser Thr Thr Leu Phe Gin Gly Gly Thr His Ser Thr 
5L3S 5590 5595 



val Ser Gin Gly Phe Pro His ser Glu He Thr Thr Leu Arg ser 
5600 5605 5610 



Arg Thr Pro Gly Asp val Ser Trp Met Thr Thr Pro Pro val Glu 
5615 5620 5625 



T!JC« S^** ser Leu Met ser pro ser Met Thr Ser 

5630 5635 5640 



Pro Ser Pro val ser Ser Thr ser Pro Glu ser He Pro Ser Ser 
5645 5650 5655 



Pro Leu Pro val Thr Ala Leu Leu Thr ser val Leu val Thr Thr 
5660 5665 5670 



Thr Asn val Leu Gly Thr Thr ser Pro Glu pro val Thr Ser Ser 



5675 



5680 



5685 
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Pro pro Asn Leu Ser Ser Pro Thr Gin Glu Arg Leu Thr Thr Tyr 
5690 5695 5700 



ASP Thr Ala His Thr Glu Ala Met His Ala ser Met His Thr 
5705 5710 5715 



Asn Thr Ala val Ala Asn val Gly Thr ser He ser Gly His Glu 
5720 5725 5730 



ser Gin Ser ser val Pro Ala Asp Ser His Thr ser Lys Ala Thr 
5735 5740 5745 



ser Pro Met Gly He Thr Phe Ala Met Gly Asp Thr Ser Val Tyr 
5750 5755 5760 



Thr ser Thr pro Ala Phe Phe Glu Thr Arg lie Gin ser Glu ser 
5765 5770 5775 



Thr Ser ser Leu lie Pro Gly Leu Arg Asp Thr Arg Thr ser Glu 
5780 5785 5790 



Glu lie Asn Thr val Ttir Glu Thr ser Thr val Leu Ser Glu val 
5795 5800 5805 



Pro Thr Thr Thr Thr Thr Glu val Ser Arg Thr Glu val He Thr 
5810 5815 5820 



Ser Ser Arg Thr Thr He ser Gly Pro Asp His ser Lys Met Ser 
5825 .5830 5835 



Pro Tyr He Ser Thr Glu Thr He Thr Arg Leu ser Thr Phe Pro 
5840 5845 5850 



Phe val Thr Gly ser Thr Glu Met Ala lie Thr Asn Gin Thr Gly 
5855 5860 5865 



Pro He Gly Thr lie ser Gin Ala Thr Leu Thr Leu Asp Thr ser 
5870 5875 5880 



Ser Thr Ala ser Trp Glu Gly Thr His Ser Pro Val Thr Gin Arg 
5885 5890 5895 



Phe Pro His Ser Glu Glu Thr Thr Thr Met ser Arg Ser Thr Lys 
5900 5905 5910 



Gly val Ser Trp Gin ser Pro Pro Ser val Glu Glu Thr ser Ser 
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5915 5920 5925 

Pro Ser Ser Pro val Pro Leu Pro Ala lie Thr ser His Ser Ser 
5930 5935 5940 

Leu Tyr Ser Ala val ser Gly ser ser Pro Thr Ser Ala Leu Pro 
5945 5950 5955 

val Thr Ser Leu Leu Thr ser Gly Arg Arg Lys Thr lie Asp Met 
5960 5965 5970 

Leu Asp Thr His ser Glu Leu val Thr ser ser Leu Pro ser Ala 
5975 5980 5985 

ser ser Phe Ser Gly Glu lie Leu Thr ser Glu Ala ser Thr Asn 
5990 5995 6000 

Thr Glu Thr lie His Phe Ser Glu Asn Thr Ala Glu Thr Asn Met 
6005 6010 6015 

Gly Thr Thr Asn ser Met His Lys Leu His ser Ser Val ser lie 
6020 6025 6030 

His Ser Gin Pro Ser Gly His Thr Pro Pro Lys val Thr Gly ser 
6035 6040 6045 

Met Met Glu Asp Ala lie val ser Thr ser Thr Pro Gly ser pro 
6050 6055 6060 

Glu Thr Lys Asn Val Asp Arg Asp ser Thr ser Pro Leu Thr Pro 
6065 6070 6075 

Glu Leu Lys Glu Asp ser Thr Ala Leu Val Met Asn ser Thr Thr 
6080 6085 6090 

Glu ser Asn Thr val Phe Ser ser val ser Leu Asp Ala Ala Thr 
6095 6100 6105 

Glu val ser Arg Ala Glu val Thr Tyr Tyr Asp Pro Thr Phe Met 
6110 6115 6120 

Pro Ala ser Ala Gin ser Thr Lys ser Pro Asp lie Ser Pro Glu 
6125 6130 6135 

Ala Ser Ser Ser His ser Asn ser Pro Pro Leu Thr lie Ser Thr 
6140 6145 6150 
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His Lvs Thr lie Ala Thr Gin Thr Gly Pro ser Gly val Thr ser 
6155 6160 6165 



Leu Gly Gin Leu Thr Leu Asp Thr Ser Thr lie Ala Thr Ser Ala 
6170 6175 6180 



Thr Pro ser Ala Arg Thr Gin Asp Phe val Asp ser Glu Thr 
6185 6190 6195 



Thr ser Val Met Asn Asn Asp Leu Asn Asp Val Leu Lys Thr Ser 
6200 6205 6210 



Pro Phe Ser Ala Glu Glu Ala Asn Ser Leu Ser Ser Gin Ala Pro 
6215 6220 6225 



Leu Leu val Thr Thr Ser Pro Ser Pro val Thr ser Thr Leu Gin 
6230 6235 6240 



Glu His Ser Thr ser ser Leu val ser val Thr ser Val Pro Thr 
6245 6250 6255 

Pro Thr Leu Ala Lys He Thr Asp Met Asp Thr Asn Leu Glu Pro 
6260 6265 6270 



val Thr Arg ser Pro Gin Asn Leu Arg Asn Thr Leu Ala Thr ser 
6275 6280 6285 

Glu Ala Thr Thr Asp Thr His Thr Met His Pro Ser He Asn Thr 
6290 6295 6300 

Ala Met Ala Asn val Gly Thr Thr Ser ser Pro Asn Glu Phe Tyr 
6305 6310 6315 

Phe Thr Val Ser Pro Asp ser Asp Pro Tyr Lys Ala Thr Ser Ala 
6320 6325 6330 



val val He Thr ser Thr ser Gly Asp ser He val ser Thr ser 
6335 6340 6345 

Met Pro Arg ser ser Ala Met Lys Lys He Glu Ser Glu Thr Thr 
6350 6355 6360 

Phe ser Leu He Phe Arg Leu Arg Glu Thr Ser Thr Ser Gin Lys 
6365 6370 6375 

He Gly ser Ser ser Asp Thr ser Thr val Phe Asp Lys Ala Phe 

^^tLa f^oc A3Q^ 



6380 



6385 
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Thr Ala Ala Thr Thr Glu val ser Arg Thr Glu Leu Thr Ser ser 
6395 6400 6405 

ser Arg Thr Ser lie Gin Gly Thr Glu Lys Pro Thr Met Ser Pro 
6410 6415 6420 

Asp Thr ser Thr Arg ser val Thr Met Leu Ser Thr Phe Ala Gly 
6425 6430 6435 

Leu Thr Lys Ser Glu Glu Arg Thr lie Ala Thr Gin Thr Gly Pro 
6440 6445 6450 

His Arg Ala Thr Ser Gin Gly Thr Leu Thr Trp Asp Thr Ser He 
6455 6460 6465 

Thr Thr ser Gin Ala Gly Thr His ser Ala Met Thr His Gly Phe 
6470 6475 6480 

ser Gin Leu Asp Leu ser Thr Leu Thr Ser Arg val Pro Glu Tyr 
6485 6490 6495 

lie Ser Gly Thr ser Pro Pro ser val Glu Lys Thr ser Ser ser 
6500 6505 6510 

ser ser Leu Leu ser Leu Pro Ala lie Thr Ser Pro ser Pro val 
6515 6520 6525 

Pro Thr Thr Leu Pro Glu ser Arg Pro ser Ser pro val His Leu 
6530 6535 6540 

Thr ser Leu Pro Thr Ser Gly Leu val Lys Thr Thr Asp Met Leu 
6545 6550 6555 

Ala Ser val Ala ser Leu Pro Pro Asn Leu Gly Ser Thr ser His 
6560 6565 6570 

Lys lie Pro Thr Thr ser Glu Asp He Lys Asp Thr Glu Lys Met 
6575 6580 6585 

Tyr pro Ser Thr Asn lie Ala val Thr Asn val Gly Thr Thr Thr 
6590 6595 6600 

Ser Glu Lys Glu ser Tyr Ser ser val Pro Ala Tyr ser Glu Pro 
6605 6610 6615 

Pro Lys Val Thr ser Pro Met val Thr ser Phe Asn lie Arg Asp 
6620 6625 6630 
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Thr lie Val Ser Thr Ser Met Pro Gly Ser ser Glu lie Thr Arg 
6635 6640 6645 



lie Glu Met Glu ser Thr Phe ser Leu Ala His Gly Leu Lys Gly 
6650 6655 6660 



Thr Ser Thr Ser Gin Asp Pro lie Val ser Thr Glu Lys ser Ala 
6665 6670 6675 



Val Leu His Lys Leu Thr Thr Gly Ala Thr Glu Thr ser Arg Thr 
6680 6685 6690 



Glu Val Ala Ser Ser Arg Arg Thr Ser lie Pro Gly Pro Asp His 
6695 6700 6705 



ser Thr Glu ser Pro Asp lie ser Thr Glu Val lie Pro Ser Leu 
6710 6715 6720 



Pro lie Ser Leu Gly lie Thr Glu Ser ser Asn Met Thr lie lie 
6725 6730 6735 



Thr Arg Thr Gly Pro Pro Leu Gly Ser Thr ser Gin Gly Thr Phe 
6740 6745 6750 



Thr Leu Asp Thr Pro Thr Thr ser ser Arg Ala Gly Thr His ser 
6755 6760 6765 

Met Ala Thr Gin Glu Phe Pro His ser Glu Met Thr Thr val Met 
6770 6775 6780 



Asn Lys Asp pro Glu lie Leu ser Trp Thr lie Pro Pro Ser He 
6785 6790 6795 



Glu Lys Thr Ser phe ser ser ser Leu Met Pro ser Pro Ala Met 
6800 6805 " 6810 



Thr Ser Pro Pro val ser ser Thr Leu Pro Lys Thr lie His Thr 
6815 6820 6825 



Thr Pro Ser Pro Met Thr ser Leu Leu Thr Pro ser Leu val Met 
6830 6835 6840 



Thr Thr Asp Thr Leu Gly Thr Ser Pro Glu Pro Thr Thr ser Ser 
6845 6850 6855 



Pro Pro Asn Leu ser ser Thr ser His val lie Leu Thr Thr Asp 
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Glu Asp Thr Thr Ala He Glu Ala Met His Pro Ser Thr ser Thr 
6875 6880 6885 



Ala Ala Thr Asn Val Glu Thr Thr Cys Ser Gly His Gly ser Gin 
6890 6895 6900 



Ser Ser Val Leu Thr Asp ser Glu Lys Thr Lys Ala Thr Ala Pro 
6905 6910 6915 



Met Asp Thr Thr Ser Thr Met Gly His Thr Thr Val Ser Thr Ser 
6920 6925 6930 



Met Ser val Ser ser Glu Thr Thr Lys He Lys Arg Glu Ser Thr 
6935 6940 6945 



Tyr Ser Leu Thr pro Gly Leu Arg Glu Thr ser lie ser Gin Asn 
6950 6955 6960 



Ala Ser Phe Ser Thr Asp Thr ser lie Val Leu Ser Glu val Pro 
6965 6970 6975 



Thr Gly Thr Thr Ala Glu Val Ser Arg Thr Glu val Thr ser ser 
6980 6985 6990 



Gly Arg Thr ser lie Pro Gly Pro ser Gin ser Thr val Leu Pro 
6995 7000 7005 



Glu He Ser Thr Arg Thr Met Thr Arg Leu Phe Ala Ser Pro Thr 
7010 7015 7020 



Met Thr Glu ser Ala Glu Met Thr lie Pro Thr Glri Thr Gly Pro 
7025 7030 7035 



Ser Gly ser Thr ser Gin Asp Thr Leu Thr Leu Asp Thr ser Thr 
7040 7045 7050 



Thr Lys Ser Gin Ala Lys Thr His Ser Thr Leu Thr Gin Arg Phe 
7055 7060 7065 



Pro His Ser Glu Met Thr Thr Leu Met Ser Arg Gly Pro Gly Asp 
7070 7075 7080 



Met ser Trp Gin ser Ser Pro ser Leu Glu Asn Pro ser Ser Leu 



7085 



7090 



7095 
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pro Ser Leu Leu ser Leu Pro Ala Thr Thr Ser Pro Pro Pro lie 
7100 7105 7110 

ser ser Thr Leu Pro val Thr He Ser ser ser Pro Leu Pro val 
7115 7120 

Thr ser Leu Leu Thr ser ser Pro Val Thr Thr Thr Asp Met Leu 
7130 7135 7140 

His Thr ser Pro Glu Leu val Thr ser Ser Pro Pro Lys Leu Ser 
7145 7150 7155 

His Thr ser Asp Glu Arg Leu Thr Thr Gly Lys Asp Thr Thr Asn 
7160 7165 7170 

Thr Glu Ala val His Pro ser Thr Asn Thr Ala Ala Ser Asn Val 
7175 7180 7185 

Glu He pro ser Phe Gly His Glu ser Pro Ser Ser Ala Leu Ala 
7190 7195 7200 

ASP ser Glu Thr Ser Lys Ala Thr Ser Pro Met Phe He Thr ser 
7205 7210 7215 

Thr Gin Glu Asp Thr Thr val Ala He ser Thr Pro His Phe Leu 
7220 7225 '*^o 

Glu Thr ser Arg He Gin Lys Glu ser He Ser Ser Leu ser Pro 
7235 7240 

Lys Leu Arg Glu Thr Gly ser ser Val Glu Thr ser ser Ala He 
' 7250 7255 7260 

Glu Thr ser Ala val Leu ser Glu val ser He Gly Ala Thr Thr 
7265 7270 '^'^ 

Glu He ser Arg Thr Glu Val Thr ser ser ser Arg Thr ser He 
7280 7285 7^»o 

ser Gly ser Ala Glu ser Thr Met Leu Pro Glu He ser Thr Thr 
7295 7300 7305 

Arg Lys He He Lys Phe Pro Thr Ser Pro He Leu Ala Glu Ser 
7310 7315 7320 

ser Glu Met Thr He Lys Thr Gin Thr ser pro Pro Gly ser Thr 
7325 7330 7335 
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Ser Glu Ser Thr Phe Thr Leu Asp Thr ser Thr Thr Pro Ser Leu 
7340 7345 7350 



val lie Thr His ser Thr Met Thr Gin Arg Leu Pro His Ser Glu 
7355 7360 7365 



lie Thr Thr Leu val Ser Arg Gly Ala Gly Asp val Pro Arg Pro 
7370 7375 7380 



ser Ser Leu Pro val Glu Glu Thr ser Pro Pro Ser ser Gin Leu 
7385 7390 7395 



ser Leu Ser Ala Met He ser Pro Ser Pro val Ser ser Thr Leu 
7400 7405 7410 



Pro Ala Ser Ser His ser Ser ser Ala Ser val Thr ser pro Leu 
7415 7420 7425 



Thr Pro Gly Gin vaV Lys Thr Thr Glu val Leu Asp Ala ser Ala 
7430 7435 7440 



Glu Pro Glu Thr ser ser Pro Pro ser Leu ser ser Thr ser val 
7445 7450 7455 



Glu lie Leu Ala Thr Ser Glu val Thr Thr Asp Thr Glu Lys lie 
7460 7465 7470 



His Pn Phe Pro Asn Thr Ala Val Thr Lys Val Gly Thr Ser Ser 
74 5 7480 7485 



Ser Gly His Glu ser Pro Ser ser val Leu Pro Asp Ser Glu Thr 
7490 7495 7500 



Thr Lys Ala Thr Ser Ala Met Gly Thr lie Ser lie Met Gly Asp 
7505 7510 7515 



Thr Ser val Ser Thr Leu Thr Pro Ala Leu ser Asn Thr Arg Lys 
7520 7525 7530 



lie Gin Ser Glu Pro Ala Ser Ser Leu Thr Thr Arg Leu Arg Glu 
7535 7540 7545 



Thr Ser Thr Ser Glu Glu Thr Ser Leu Ala Thr Glu Ala Asn Thr 
7550 7555 7560 



Val Leu Ser Lys val Ser Thr Gly Ala Thr Thr Glu Val ser Arg 



7565 



7570 



7575 
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Thr Glu Ala ne ser Phe ser Arg Thr ser Met Ser Gly Pro Glu 
7580 7585 7590 

Gin ser Thr Met ser Gin Asp He ser He Gly Thr lie Pro Arg 
7595 7600 7605 

He ser Ala ser ser val Leu Thr Glu ser Ala Lys Met Thr lie 
7610 7615 7620 

Thr Thr Gin Thr Gly Pro ser Glu ser Thr Leu Glu Ser Thr Leu 
7625 7630 7635 

Asn Leu Asn Thr Ala Thr Thr Pro ser Trp val Glu Thr His ser 
7640 7645 7650 

He val He Gin Gly Phe Pro His Pro Glu Met Thr Thr ser Met 
7655 7660 7665 

Gly Arg Gly pro Gly Gly val Ser Trp Pro ser Pro Pro Phe val 
7670 7675 7b8U 

Lvs Glu Thr ser Pro Pro ser ser Pro Leu ser Leu Pro Ala Val 
7685 7690 7695 

Thr ser Pro His Pro Val ser Thr Thr Phe Leu Ala His He Pro 
7700 7705 7710 

pro ser Pro Leu Pro val Thr Ser Leu Leu Thr ser Gly Pro Ala 
7715 7720 ' 

Thr Thr Thr Asp He Leu Gly^ Thr ser Thr Glu Pro Gly Thr ser 
7730 7735 ''^^ 

ser ser Ser ser Leu ser Thr Thr Ser His Glu Arg Leu Thr Thr 
7745 7750 77b5 

Tyr Lys Asp Thr Ala His Thr Glu Ala val His Pro ser Thr Asn 
7760 7765 77/U 

Thr Gly Gly Thr Asn val Ala Thr Thr Ser Ser Gly Tyr Lys ser 
7775 7780 7785 

Gin ser ser val Leu Ala Asp ser ser Pro Met cys Thr Thr Ser 
7790 7795 7800 

Thr Met Gly Asp thr ser val Leu Thr ser Thr Pro Ala Phe Leu 
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Glu Thr Arg Arg He Gin Thr Glu Leu Ala Ser Ser Leu Thr Pro 
7820 7825 7830 



Gly Leu Arg Glu Ser Ser Gly Ser Glu Gly Thr Ser Ser Gly Thr 
7835 7840 7845 



Lys Met Ser Thr Val Leu ser Lys Val Pro Thr Gly Ala Thr Thr 
7850 7855 7860 



Glu lie ser Lys Glu Asp val Thr Ser He Pro Gly Pro Ala Gin 
7865 7870 7875 



ser Thr lie Ser Pro Asp Thr ser Thr Arg Thr val ser Trp Phe 
7880 7885 7890 



ser Thr ser Pro Val Met Thr Glu ser Ala Glu lie Thr Met Asn 
7895 7900 7905 



Thr His Thr Ser pro Leu Gly Ala Thr Thr Gin Gly Thr Ser Thr 
7910 7915 7920 



Leu Asp Thr ser ser Thr Thr Ser Leu Thr Met Thr His ser Thr 
7925 7930 7935 



lie ser Gin Gly Phe ser His ser Gin Met Ser Thr Leu Met Arg 
7940 7945 7950 



Gly Pro Glu Asp Val ser Trp Met Ser Pro Pro Leu Leu Glu 
7955 7960 7965 



Lys Thr Arg pro Ser Phe Ser Leu Met ser Ser Pro Ala Thr Thr 
7970 7975 7980 



ser Pro ser Pro val ser ser Thr Leu Pro Glu Ser lie ser ser 
7985 7990 7995 



Ser pro Leu Pro val Thr ser Leu Leu Thr Ser Gly Leu Ala Lys 
8000 8005 8010 



Thr Thr Asp Met Leu His Lys Ser Ser Glu Pro Val Thr Asn Ser 
8015 8020 8025 



Pro Ala Asn Leu Ser ser Thr Ser val Glu He Leu Ala Thr Ser 
8030 8035 8040 
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Glu val Thr Thr Asp Thr Glu Lys Thr His Pro ser Ser Asn Arg 
8045 8050 8055 

Thr val Thr Asp Val Gly Thr ser ser ser Gly His Glu Ser Thr 
8060 8065 8070 

ser Phe Val Leu Ala Asp Ser Gin Thr ser Lys val Thr Ser pro 
8075 8080 8085 

Met Val lie Thr Ser Thr Met Glu Asp Thr Ser val ser Thr Ser 
8090 8095 8100 

Thr Pro Gly Phe Phe Glu Thr Ser Arg lie Gin Thr Glu Pro Thr 
8105 8110 8115 

Ser Ser Leu Thr Leu Gly Leu Arg Lys Thr Ser Ser Ser Glu Gly 
8120 8125 8130 

Thr Ser Leu Ala Thr Glu Met Ser Thr val Leu Ser Gly val Pro 
8135 8140 8145 

Thr Gly Ala Thr Ala Glu val Ser Arg Thr Glu val Thr ser ser 
8150 8155 8160 

ser Arg Thr ser He ser Gly Phe Ala Gin Leu Thr val ser Pro 
8165 . 8170 8175 

Glu Thr ser Thr Glu Thr lie Thr Arg Leu Pro Thr ser ser lie 
8180 8185 8190 

Met Thr Glu Ser Ala Glu Met Met lie Lys Thr Gin Thr Asp Pro 
8195 8200 8205 

Pro Gly ser Thr Pro Glu ser Thr His Thr Val Asp lie Ser Thr 
8210 8215 8220 

Thr Pro Asn Trp val Glu Thr His ser Thr val Thr Gin Arg Phe 
8225 8230 8235 

ser His ser Glu Met Thr Thr Leu val ser Arg ser Pro Gly Asp 
8240 8245 8250 

Met Leu Trp Pro ser Gin ser Ser Val Glu Glu Thr ser ser Ala 
8255 8260 8265 

Ser Ser Leu Leu Ser Leu Pro Ala Thr Thr ser Pro ser pro val 
8270 8275 8280 
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Ser Ser Thr Leu val Glu Asp Phe Pro ser Ala Ser Leu pro Val 
8285 8290 8295 



Thr Ser Leu Leu Thr Pro Gly Leu Val lie Thr Thr Asp Arg Met 
8300 8305 8310 



Gly lie Ser Arg Glu Pro Gly Thr Ser ser Thr Ser Asn Leu Ser 
8315 8320 8325 



Ser Thr Ser His Glu Arg. Leu Thr Thr Leu Glu Asp Thr val Asp 
8330 8335 8340 



Thr Glu Ala Met Gin Pro ser Thr His Thr Ala val Thr Asn val 
8345 8350 8355 



Arg Thr Ser lie ser Gly His Glu ser Gin ser ser val Leu ser 
8360 8365 8370 



Asp Ser Glu Thr pro Lys Ala Thr Ser ser Met Gly Thr Thr Tyr 
8375 8380 8385 



Thr Met Gly Glu Thr Ser Val Ser lie Ser Thr Ser Asp Phe Phe 
8390 8395 8400 



Glu Thr Ser Arg val Gin lie Glu Pro Thr Ser Ser Leu Thr ser 
8405 8410 8415 



Gly Leu Arg Glu Thr Ser Ser ser Glu Arg lie Ser ser Ala Thr 
8420 8425 8430 



Glu Gly Ser Thr val Leu ser Glu val pro ser Gly Ala Thr Thr 
8435 8440 8445 



Glu Val Ser Arg Thr Glu val He ser ser Arg Gly Thr ser Met 
8450 8455 8460 



ser Gly Pro Asp Gin Phe Thr lie Ser Pro Asp lie Ser Thr Glu 
8465 8470 8475 



Ala lie Thr Arg Leu Ser Thr Ser Pro lie Met Thr Glu Ser Ala 
8480 8485 8490 



Glu Ser Ala lie Thr He Glu Thr Gly ser Pro Gly Ala Thr ser 
8495 8500 8505 



Glu Gly Thr Leu Thr Leu Asp Thr ser Thr Thr Thr Phe Trp Ser 



8510 



8515 



8520 



Page 236 



wo 2004/045553 



PCT/US2003/037041 





022438 . 43867 (pct2) . ST25 . txt 



Thr His ser Thr Ala Ser Pro Gly Phe Ser His Ser Glu Met 
8525 8530 853S 



Thr Thr Leu Met ser Arg Thr Pro Gly Asp Val Pro Trp Pro ser 
8540 8545 8550 



Leu Pro ser Val Glu Glu Ala ser Ser val Ser ser Ser Leu Ser 
8555 8560 8565 



ser Pro Ala Met Thr ser Thr ser Phe Phe ser Ala Leu Pro Glu 
8570 8575 8580 



ser lie ser Ser ser Pro His Pro Val Thr Ala Leu Leu Thr Leu 
8585 8590 8595 



Gly Pro Val Lys Thr Thr Asp Met Leu Arg Thr ser Ser Glu Pro 
8600 8605 8610 

Glu Thr ser ser pro Pro Asn Leu ser ser Thr Ser Ala Glu lie 
8615 8620 8625 



Leu Ala Thr Ser Glu Val Thr Lys Asp Arg Glu Lys He His Pro 
8630 8635 8640 



ser ser Asn Thr pro val val Asn val Gly Thr val He Tyr Lys 
8645 8650 8655 



His Leu ser pro ser ser val Leu Ala Asp Leu val Thr Thr Lys 
8660 8665 8670 



Pro Thr Ser Pro Met Ala Thr Thr ser Thr Leu Gly Asn Thr ser 
8675 8680 8685 



val ser Thr ser Thr Pro Ala Phe pro Glu Thr Met Met Thr Gin 
8690 8695 8700 



pro Thr Ser ser Leu Thr Ser Gly Leu Arg Glu lie ser Thr Ser 
8705 8710 8715 



Gin Glu Thr ser ser Ala Thr Glu Arg Ser Ala ser Leu Ser Gly 
8720 8725 8730 



Met Pro Thr Gly Ala Thr Thr Lys val Ser Arg Thr Glu Ala Leu 
8735 8740 8745 



Ser Leu Gly Arg Thr ser Thr pro Gly Pro Ala Gin Ser Thr lie 
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8750 8755 8760 

o?2e ^"l" '^^^ ne Thr Arg He ser Thr Pro 

8765 8770 8775 

'^^^ ser Ala Glu Met Thr He Thr pro Lys Thr 
8780 8785 8790 

SiSr ser Gin Gly Thr Phe Thr Leu Asp Thr 

8795 8800 8805 

ilJn "Thr His ser Ala Ala Thr His 

8810 8815 8820 

ISS.. ^ly Met Thr Thr Pro Met Ser Arg Gly pro 

8825 8830 8835 

Arg Pro ser val Glu Lys Thr Ser 
8840 8845 8850 

SE?!. '-^^ ^^"l Ser Leu Ser Ala val Thr ser Pro Ser 

8855 8860 8865 

bSii^ '^y^ ser Thr Pro Ser Glu ser ser His Ser Ser Pro Leu 
8870 8875 8880 

'^^^ Ser Leu Phe Thr Pro Val Met Met Lys Thr Thr asd 
8885 8890 8895 

Met Leu Asp Thr ser Leu Glu Pro Val Thr Thr Ser Pro Pro ser 
8900 8905 8910 

^5?. '^^^ ser Asp Glu Ser Leu Ala Thr Ser Lys Ala Thr 
8915 8920 8925 

Sly^ '^^^ A""^ ^""^ Leu Ser Glu Asn Thr Ala Val Thr 
8930 8935 8940 

^^y '^^^ ^"l® ser Ala Arg Gin Glu Phe Tyr ser Ser Tyr 
8945 8950 8955 

^l^^ Ser Lys Val Thr ser Pro val Val Thr 

8960 8965 8970 

oS?r "^^^ "-ys Asp He val Ser Thr Thr He Pro Ala ser 
8975 8980 8985 
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ser Glu lie Thr Arg He gIu Met Glu Ser Tbr ser Thr Leu Thr 
8990 8995 9000 

pro Thr' Pro Arg Glu Thr Ser Thr Ser Gin Glu lie His Ser Ala 
9005 9010 9015 

Thr Lys Pro Ser Thr Val Pro Tyr Lys Ala Leu Thr ser Ala Thr 
9020 9025 9030 

lie Glu Asp Ser Met Thr Gin Val Met ser Ser Ser Arg Gly Pro 
9035 9040 9045 

ser Pro Asp Gin Ser Thr Met ser Gin Asp lie ser Thr Glu val 
9050 9055 9060 

lie Thr Arg Leu ser Thr Ser Pro He Lys Ala Glu ser Thr Glu 
9065 9070 9075 

Met Thr He Thr Thr Gin Thr Gly Ser Pro Gly Ala Thr Ser Arg 
9080 9085 9090 

Gly Thr Leu Thr Leu Asp Thr ser Thr Thr Phe Met ser Gly Thr 
9095 9100 9105 

His Ser Thr Ala Ser Gin Gly Phe Ser His ser Gin Met Thr Ala 
9110 9115 9120 

Leu Met ser Arg Thr Pro Gly Asp Val Pro Trp Leu Ser His Pro 
9125 9130 9135 

Ser val Glu Glu Ala Ser ser Ala Ser Phe Ser Leu Ser Ser Pro 
9140 9145 9150 

val Met Thr ser ser Ser Pro val Ser ser Thr Leu Pro Asp Ser 
9155 9160 9165 

lie His Ser ser ser Leu Pro val Ttir Ser Leu Leu Thr Ser Gly 
9170 9175 9180 

Leu val Lys Thr Thr Glu Leu Leu Gly Thr Ser ser Glu Pro Glu 
9185 9190 9195 

Thr Ser Ser Pro Pro Asn Leu ser ser Thr ser Ala Glu lie Leu 
9200 9205 9210 

Ala Thr Thr Glu Val Thr Thr Asp Thr Glu Lys Leu Glu Met Thr 
9215 9220 9225 
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Asn Val Val Thr ser Gly Tyr Thr His Glu Ser Pro Sep Ser val 
9230 9235 9240 



Leu Ala Asp Ser Val Thr Thr Lys Ala Thr Ser ser Met Gly lie 
9245 9250 9255 



Thr Tyr Pro Thr Gly Asp Thr Asn val Leu Thr Ser Thr Pro Ala 
9260 9265 9270 



Phe Ser Asp Thr Ser Arg lie Gin Thr Lys Ser Lys Leu Ser Leu 
9275 9280 9285 



Thr Pro Gly Leu Met Glu Thr ser lie ser Glu Glu Thr Ser Ser 
9290 9295 9300 



Ala Thr Glu Lys ser Thr val Leu Ser ser Val Pro Thr Gly Ala 
9305 9310 9315 



Thr Thr Glu Val ser Arg Thr Glu Ala lie ser Ser Ser Arg Thr 
9320 9325 9330 



Ser lie Pro Gly Pro Ala Gin Ser Thr Met Ser Ser Asp Thr Ser 
9335 9340 9345 



Met Glu Thr lie Thr Arg lie ser Thr Pro Leu Thr Arg Lys Glu 
9350 9355 9360 



ser Thr Asp Met Ala lie Thr Pro Lys Thr Gly Pro Ser Gly Ala 
93C5 9370 9375 



Thr Ser Gin Gly Thr Phe Thr Leu Asp ser ser ser Thr Ala ser 
9380 9385 9390 



Trp Pro Gly Thr His Ser Ala Thr Thr Gin Arg Phe Pro Gin ser 
9395 9400 9405 



val val Thr Thr Pro Met Ser Arg Gly Pro Glu Asp val Ser Trp 
9410 9415 9420 



Pro ser Pro Leu Ser val Glu Lys Asn ser Pro Pro ser Ser Leu 
9425 9430 9435 



Val Ser ser Ser Ser Val Thr ser Pro Ser Pro Leu Tyr Ser Thr 
9440 9445 9450 



Pro ser Gly ser ser His ser ser Pro val Pro Val Thr Ser Leu 



9455 



9460 



9465 
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Phe Thr ser lie Met Met Lys Ala Thr Asp Met Leu Asp Ala Ser 
9470 9475 9480 



Leu Glu Pro Glu Thr Thr ser Ala Pro Asn Met Asn lie Thr Ser 
9485 9490 9495 



Asp Glu ser Leu Ala Thr ser Lys Ala Thr Thr Glu Thr Glu Ala 
9500 9505 9510 



lie His val Phe Glu Asn Thr Ala Ala Ser His Val Glu Thr Thr 
9515 9520 9525 



ser Ala Thr Glu Glu Leu Tyr ser Ser Ser Pro Gly Phe Ser Glu 
9530 9535 9540 



Pro Thr Lys val lie ser Pro val val Thr ser Ser Ser lie Arg 
9545 9550 9555 



Asp Asn Met val ser Thr Thr Met Pro Gly ser Ser Gly He Thr 
9560 9565 9570 



Arg He Glu lie Glu ser Met ser ser Leu Thr Pro Gly Leu Arg 
9575 9580 9585 

Glu Thr Arg Thr Ser Gin Asp He Thr ser Ser Thr Glu "rtir ser 
9590 9595 9600 



Thr val Leu Tyr Lys Met Ser Ser Gly Ala Thr Pro Glu val Ser 
9605 9610 9615 



Thr Glu Val Met Pro Ser Ser Arg Thr ser He Pro Gly pro 
9620 9625 9630 



Ala Gin Ser Thr Met Ser Leu Asp lie Ser Asp Glu val val Thr 
9635 9640 9645 



Arg Leu Ser Thr ser Pro lie Met Thr Glu Ser Ala Glu lie Thr 
9650 9655 9660 



lie Thr Thr Gin Thr Gly Tyr Ser Leu Ala Thr ser Gin Val Thr 
9665 9670 9675 



Leu Pro Leu Gly Thr ser Met Thr Phe Leu ser Gly Thr His Ser 
9680 9685 9690 



Thr Met Ser Gin Gly Leu Ser His ser Glu Met Thr Asn Leu Met 
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9695 9700 9705 

QT?n s^"* ■^''P Thr ser Pro Arg Phe Val 

9710 9715 9720 

'^''9 5^'' ser ser ser Leu Thr ser Leu pro Leu Thr 
9725 9730 9735 

s^'' Ser Thr Leu Leu Asp ser Ser pro 
9740 9745 9750 

ser ser Pro Leu pro yal Thr ser Leu He Leu Pro Gly Leu yal 
9755 9760 9765 

'-y^ '^^'^ Asp Thr Ser ser Glu Pro Lys Thr ser 

9770 9775 9780 

ser ser Thr Ser val Glu lie Pro Ala Thr 
9785 9790 9795 

tVL '^^ '^^P "-ys lie His Pro Ser Ser Asn 

9800 9805 9810 

'^^^ "-ys Val Arg Thr Ser ser ser Val His Glu ser 
9815 9820 9825 

"■•^ SSC« '^^a Asp Ser Glu Thr Thr He Thr He pro 

9830 9835 9840 

JJf^r Sfif Ala val Asp Asp Thr Thr Val Phe Thr 

9845 9850 9855 

s®'' Asn Pro Ala Phe ser Glu Thr Arg Arg He Pro Thr Glu pro 
9860 9865 9870 

^^"^ Pf'e Arg Glu Thr ser Thr ser Glu 

9875 9880 9885 

X&^« '^'^ Ser He Thr Glu Thr ser Ala Val Leu Tyr Gly Val 
9890 9895 9900 

lil^r Ala Thr Thr Glu val Ser net Thr Glu He Met ser 
9905 9910 9915 

nS5« ^""S '^'^ Asp ser Asp Gin Ser Thr Met ser 

9920 9925 9930 
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pro ASP He He Thr Glu val He Thr Arg Leu ser ser Ser ser 
9935 9940 9945 

Met Met ser Glu ser Thr Gin Met Thr He Thr Thr Gin Lys Ser 
9950 9955 9960 

ser pro Gly Ala Thr Ala Gin ser Thr Leu Thr Leu Ala Thr Thr 
9965 9970 9975 

Thr Ala pro Leu Ala Arg Thr His Ser Thr val Pro Pro Arg Phe 
9980 9985 9990 

Leu His ser Glu Met Thr Thr Leu Met Ser Arg Ser Pro Glu Asn 
9995 10000 10005 

pro ser Trp Lys Ser ser Pro Phe Val Glu Lys Thr Ser Ser Ser 
10010 10015 10020 

ser ser Leu Leu ser Leu Pro Val Thr Thr Ser Pro Ser Val Ser 
10025 10030 10035 

ser Thr Leu Pro Gin Ser He . Pro Ser Ser ser Phe ser val Thr 
10040 10045 10050 

ser Leu Leu Thr Pro Gly Met val Lys Thr Thr Asp Thr ser Thr 
10055 10060 10065 

Glu pro Gly Thr ser Leu ser Pro Asn Leu ser Gly Thr ser val 
10070 10075 10080 

Glu He Leu Ala Ala Ser Glu Val Thr Thr Asp Thr Glu Lys He 
10085 10090 10095 

His pro ser ser Ser Met Ala Val Thr Asn val Gly Thr Thr ser 
10100 10105 10110 

ser Gly His Glu Leu Tyr ser ser val ser He His ser Glu pro 
10115 10120 10125 

ser Lys Ala Thr Tyr Pro val Gly Thr pro ser ser Met Ala Glu 
10130 10135 10140 

Thr ser He ser Thr ser Met Pro Ala Asn Phe Glu Thr Thr Gly 
10145 10150 10155 

Phe Glu Ala Glu Pro Phe ser His Leu Thr ser Gly Phe Arg Lys 
10160 10165 10170 
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Thr Asn Met ser Leu Asp Thr ser ser val Thr Pro Thr Asn Thr 
10175 10180 10185 



Pro ser ser Pro Gly ser Thr His Leu Leu Gin Ser ser Lys Thr 
10190 10195 10200 



Asp Phe Thr Ser ser Ala Lys Thr ser ser Pro Asp Trp Pro Pro 
10205 10210 10215 



Ala ser Gin Tyr Thr Glu lie Pro val Asp lie lie Thr Pro Phe 
10220 10225 10230 



Asn Ala ser Pro Ser lie Thr Glu ser Thr Gly lie Thr ser Phe 
10235 10240 10245 



Pro Glu ser Arg Phe Thr Met ser val Thr Glu ser Thr His His 
10250 10255 10260 



Leu ser Thr Asp Leu Leu Pro ser Ala Glu Thr lie ser Thr Gly 
10265 10270 10275 



Thr val Met Pro ser Leu ser Glu Ala Met Thr ser Phe Ala Thr 
10280 10285 10290 



Thr Gly Val Pro Arg Ala lie ser Gly ser Gly ser Pro Phe ser 
10295 10300 10305 



Thr Glu Ser Gly Pro Gly Asp Ala Thr Leu ser Thr lie Ala 
10310 10315 10320 



Glu Ser Leu pro ser Ser Thr Pro val Pro Phe ser ser ser Thr 
10325 10330 10335 



Phe Thr Thr Thr Asp ser ser Thr He Pro Ala Leu His Glu He 
10340 10345 10350 



Thr ser ser ser Ala Thr pro Tyr Arg Val Asp Thr ser Leu Gly 
10355 10360 10365 



Thr Glu Ser ser Thr Thr Glu Gly Arg Leu val Met val ser Thr 
10370 10375 10380 



Leu Asp Thr Ser Ser Gin Pro Gly Arg Thr ser ser Thr Pro lie 
10385 10390 10395 



Leu Asp Thr Arg Met Thr Glu ser Val Glu Leu Gly Thr val Thr 



10400 



10405 



10410 
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ser Ala Tyr Gin Val Pro Ser Leu ser Thr Arg Leu Thr Arg Thr 
10415 10420 10425 

Asp Gly He Met Glu His lie Thr Lys lie Pro Asn Glu Ala Ala 
10430 10435 10440 

His Arg Gly Thr lie Arg Pro Val Lys Gly Pro Gin Thr Ser Thr 
10445 10450 10455 

ser Pro Ala ser Pro Lys Gly Leu His Thr Gly Gly Thr Lys Arg 
10460 10465 10470 

Met Glu Thr Thr Thr Thr Ala Leu Lys Thr Thr Thr Thr Ala Leu 
10475 10480 10485 

Lys Thr Thr ser Arg Ala Thr Leu Thr Thr ser val Tyr Thr Pro 
10490 10495 10500 

Thr Leu Gly Thr Leu Thr Pro Leu Asn Ala ser Arg Gin Met Ala 
10505 10510 10515 

ser Thr lie Leu Thr Glu Met Met He Thr Thr pro Tyr val Phe 
10520 10525 10530 

Pro Asp Val Pro Glu Thr Thr ser Ser Leu Ala Thr Ser Leu Gly 
10535 10540 10545 

Ala Glu Thr Ser Thr Ala Leu Pro Arg Thr Thr Pro ser val Leu 
10550 10555 10560 

Asn Arg Glu Ser Glu Thr Thr Ala Ser Leu Val Ser Arg ser Gly 
10565 10570 10575 

Ala Glu Arg Ser Pro Val lie Gin Thr Leu Asp Val Ser Ser Ser 
10580 10585 10590 

Glu Pro Asp Thr Thr Ala Ser Trp Val lie His Pro Ala Glu Thr 
10595 10600 10605 

lie Pro Thr Val Ser Lys Thr Thr Pro Asn Phe Phe His Ser Glu 
10610 10615 10620 

Leu Asp Thr Val Ser Ser Thr Ala Thr Ser His Gly Ala Asp val 
10625 10630 10635 

Ser Ser Ala lie Pro Thr Asn He Ser Pro Ser Glu Leu Asp Ala 
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10640 10645 10650 

, Leu Thr pro Leu Val Thr lie ser Gly Thr Asp Thr Ser Thr Thr 
10655 10660 10665 

Phe Pro Thr Leu Thr Lys Ser Pro His Glu Thr Glu Thr Arg Thr 
10670 10675 10680 

Thr Trp Leu Thr His Pro Ala Glu Thr Ser Ser Thr lie Pro Arg 
10685 10690 10695 

Thr lie Pro Asn Phe Ser His His Glu Ser Asp Ala Thr Pro Ser 
10700 10705 10710 

lie Ala Thr Ser Pro Gly Ala Glu Thr Ser ser Ala lie Pro lie 
10715 10720 10725 

Met Thr Val Ser Pro Gly Ala Glu Asp Leu Val Thr ser Gin val 
10730 10735 10740 

Thr Ser Ser Gly Thr Asp Arg Asn Met Thr He Pro Thr Leu Thr 
10745 10750 10755 

Leu Ser Pro Gly Glu Pro Lys Thr lie Ala ser Leu val Thr His 
10760 10765 10770 

Pro Glu Ala Gin Thr ser ser Ala He Pro Thr Ser Thr lie ser 
10775 10780 10785 

Pro Ala val Ser Arg Leu val Thr ser Met val Thr ser Leu Ala 
10790 10795 10800 

Ala Lys Thr Ser Thr Thr Asn Arg Ala Leu Thr Asn ser Pro Gly 
10805 10810 10815 

Glu pro Ala Thr Thr val ser Leu val Thr His Pro Ala Gin Thr 
10820 10825 10830 

ser Pro Thr val Pro Trp Thr Thr Ser lie Phe Phe His Ser Lys 
10835 10840 10845 

ser Asp Thr Thr Pro ser Met Thr Thr Ser His Gly Ala Glu Ser 
10850 10855 10860 

Ser Ser Ala val Pro Thr Pro Thr Val Ser Thr Glu val Pro Gly 
10865 10870 10875 
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Val Val Thr pro Leu val Thr ser ser Arg Ala val lie Ser Thr 
10880 10885 10890 

Thr lie pro lie Leu Thr Leu ser Pro Giy Glu Pro Glu Thr Thr 
10895 10900 10905 

pro ser Met Ala Thr ser His Giy Glu Glu Ala ser ser Ala He 
10910 10915 10920 

Pro Thr Pro Thr Val Ser Pro 6ly val Pro Giy val val Thr ser 
10925 10930 10935 

Leu val Thr Ser Ser Arg Ala Val Thr ser Thr Thr lie pro lie 
10940 10945 10950 

Leu Thr Phe Ser Leu Giy Glu Pro Glu Thr Thr Pro ser Met Ala 
10955 10960 10965 

Thr ser His Giy Thr Glu Ala Giy Ser Ala Val Pro Thr val Leu 
10970 10975 10980 

Pro Glu val pro Giy Met val Thr Ser Leu Val Ala Ser Ser Arg 
10985 10990 10995 

Ala val Thr ser Thr Thr Leu Pro Thr Leu Thr Leu Ser Pro Giy 
11000 11005 11010 

Glu Pro Glu Thr Thr Pro ser Met Ala Thr Ser His Giy Ala Glu 
11015 11020 11025 

Ala Ser ser Thr val Pro Thr val Ser pro Glu val Pro Giy val 
11030 11035 11040 

Val Thr ser Leu Val Thr ser Ser Sef Giy val Asn Ser Thr Ser 
11045 11050 11055 

lie Pro Thr Leu lie Leu ser Pro Giy Glu Leu Glu Thr Thr Pro 
U060 11065 11070 

ser Met Ala Thr Ser His Giy Ala Glu Ala Ser ser Ala val Pro 
11075 11080 11085 

Thr Pro Thr val Ser pro Giy val Ser Giy Val val Thr Pro Leu 
11090 11095 11100 

Val Thr ser ser Arg Ala val Thr ser Thr Thr He Pro He Leu 
11105 11110 11115 
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Thr Leu ser ser ser Glu Pro Glu Thr Thr Pro ser Met Ala Thr 
11120 11125 11130 



ser His Gly val Glu Ala Ser Ser Ala Val Leu Thr Val Ser Pro 
11135 11140 11145 



Glu Val pro Gly Met Val Thr Ser Leu val Thr Ser Ser Arg Ala 
mSO 11155 11160 



Val Thr Ser Thr Thr lie Pro Thr Leu Thr He ser ser Asp Glu 
11165 11170 11175 



Pro Glu Thr Thr Thr Ser Leu Val Thr His Ser Glu Ala Lys Met 
11180 11185 11190 



lie Ser Ala He Pro Thr Leu Ala Val Ser Pro Thr Val Gin Gly 
11195 11200 11205 



Leu val Thr ser Leu val Thr ser ser Gly ser Glu Thr Ser Ala 
11210 11215 11220 



Phe ser Asn Leu Thr val Ala ser ser Gin Pro Glu Thr lie Asp 
11225 11230 11235 



Ser Trp val Ala His pro Gly Thr Glu Ala ser ser val val pro 
11240 11245 11250 



Thr Leu Thr val ser Thr Gly Glu Pro Phe Thr Asn lie ser Leu 
112^5 11260 11265 



Val Thr His Pro Ala Glu Ser ser ser Thr Leu Pro Arg Thr Thr 
11270 11275 11280 



Ser Arg Phe ser His Ser Glu Leu Asp Thr Met Pro ser Thr Val 
11285 11290 11295 



Thr ser Pro Glu Ala Glu ser ser ser Ala He ser Thr Thr He 
11300 11305 11310 



Ser Pro Gly He Pro Gly val Leu Thr ser Leu val Thr ser Ser 
11315 11320 11325 



Gly Arg 'Asp He Ser Ala Thr Phe Pro Thr val Pro Glu ser Pro 
11330 11335 11340 



His Glu ser Glu Ala Thr Ala ser Trp Val Thr His Pro Ala Val 



11345 



11350 



11355 
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Thr Ser Thr Thr val Pro Arg Thr Thr Pro Asn Tyr Ser His ser 
11360 11365 11370 



Glu Pro Asp Thr Thr Pro ser He Ala Thr ser Pro Gly Ala Glu 
11375 11380 11385 



Ala Thr ser Asp Phe Pro Thr lie Thr Val Ser Pro Asp Val Pro 
11390 11395 11400 



Asp Met Val Thr Ser Gin Val Thr Ser Ser Gly Thr Asp Thr Ser 
11405 11410 11415 



lie Thr lie Pro Thr Leu Thr Leu Ser Ser Gly Glu Pro Glu Thr 
11420 11425 11430 



Thr Thr ser Phe lie Thr Tyr Ser Glu Thr His Thr Ser ser Ala 
11435 11440 11445 



lie Pro Thr Leu Pro Val ser Pro Gly Ala ser Lys Met Leu Thr 
11450 11455 11460 



Ser Leu Val lie Ser ser Gly Thr Asp ser Thr Thr Thr Phe Pro 
11465 11470 11475 



Thr Leu Thr Glu Thr Pro Tyr Glu Pro Glu Thr Thr Ala lie Gin 
11480 11485 11490 



Leu He His Pro Ala Glu Thr Asn Thr Met Val Pro Arg Thr Thr 
11495 11500 11505 



Pro Lys Phe Ser His Ser Lys Ser Asp Thr Thr Leu Pro Val Ala 
11510 11515 11520 



lie Thr Ser Pro Gly Pro Glu Ala Ser ser Ala val ser Thr Thr 
11525 11530 11535 



Thr He ser Pro Asp Met Ser Asp Leu val Thr Ser Leu val Pro 
11540 11545 11550 



Ser Ser Gly Thr Asp Thr ser Thr Thr Phe Pro Thr Leu Ser Glu 
11555 11560 11565 



Thr Pro Tyr Glu Pro Glu Thr Thr Ala Thr Trp Leu Thr His Pro 
11570 11575 11580 



Ala Glu Thr ser Thr Thr val Ser Gly Thr He Pro Asn Phe Ser 
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11590 11595 



^r?«« ^^i" A""^ Ser Met val Thr ser pro Gl 

11600 11605 11610 



^fB.. Arg Sep Gly val pro Thr Thr Thr He Pro Pro Ser 
11615 11620 11625 



''■•^^ ser Gin val Thr ser ser Ala Thr Asp 
11630 11635 11640 



f?II.^ "Thr Leu Thr pro ser pro Gly Glu Pro 

11645 11650 11655 



TtfJ!^ '^^^ Ser ser Ala Thr His Pro Gly Thr Gin Thr 
11660 11665 11670 



T?!;,. P''o lie Arg Thr val Pro ser ser Glu pro Asp Thr 
11675 11680 11685 



■^^^ ^32^^ ^^"^ Tf'P Val Thr His pro Pro Gin Thr Ser Thr Pro Val 
11690 11695 11700 



ser Arg Thr Thr Ser Ser phe Ser His Ser ser Pro Asp Ala Thr 
11705 11710 11715 



y?L^ Met Ala Thr Ser pro Arg Thr Glu Ala ser ser Ala Val 
11720 11725 11730 



Leu Thr Thr lie ser Pro Gly Ala Pro Glu Met val Thr Ser Gin 
11735 11740 11745 



He Thr ser ser Gly Ala Ala Thr Ser Thr Thr val pro Ttir Leu 
11750 11755 11760 



Thr His ser Pro Gly Met Pro Glu Ttir Thr Ala Leu Leu Ser Thr 
11765 11770 11775 



His Pro Arg Thr Glu Thr ser Lys Thr Phe Pro Ala ser Thr Val 
11780 11785 11790 



Phe Pro Gin val Ser Glu Thr Thr Ala Ser Leu Thr lie Arg Pro 
.11795 11800 11805 



Gly Ala Glu Thr Ser Thr Ala Leu Pro Thr Gin Thr Thr ser Ser 



11810 



11815 



11820 
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Leu Phe Thr Leu Leu val Thr Gly Thr ser Arg val Asp Leu ser 
11825 11830 11835 

pro Thr Ala Ser Pro Gly val Ser Ala Lys Thr Ala Pro Leu ser 
11840 11845 11850 

Thr His Pro Gly Thr 61 u Thr ser Thr Met lie Pro Thr Ser Thr 
11855 11860 11865 

Leu Ser Leu Gly Leu Leu Glu Thr Thr Gly Leu Leu Ala Thr Ser 
11870 11875 11880 

Ser ser Ala Glu Thr Ser Thr ser Thr Leii Thr Leu Thr val Ser 
11885 11890 11895 

Pro Ala val ser Gly Leu ser ser Ala ser He Thr Thr Asp Lys 
11900 11905 11910 

Pro Gin Thr val Thr Ser Trp Asn Thr Glu Thr ser Pro ser val 
11915 U920 11925 

Thr ser Val Gly Pro Pro Glu Phe ser Arg Thr Val Thr Gly Thr 
11930 11935 11940 

Thr Met Thr Leu He Pro ser Glu Met Pro Thr Pro Pro Lys Thr 
11945 11950 11955 

ser His Gly Glu Gly Val ser Pro Thr Thr lie Leu Arg Thr Thr 
11960 11965 11970 

Met Val Glu Ala Thr Asn Leu Ala Thr Thr Gly ser ser Pro Thr 
11975 11980 11985 

val Ala Lys Thr Thr Thr Thr Phe Asn Thr Leu Ala Gly Ser Leu 
11990 11995 12000 

Phe Thr Pro Leu Thr Thr Pro Gly Met Ser Thr Leu Ala Ser Glu 
12005 12010 12015 

ser val Thr ser Arg Thr ser Tyr Asn His Arg Ser Trp He ser 
12020 12025 12030 

Thr Thr Ser ser Tyr Asn Arg Arg Tyr Trp Thr Pro Ala Thr Ser 
12035 12040 12045 

Thr Pro val Thr Ser Thr Phe Ser Pro Gly He ser Thr Ser Ser 
12050 12055 12060 
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f^S^r 5®'' '^^ '^"'3 -^""a Thr Val pro Phe Met val Pro Phe 
12065 12070 12075 

i-?non '^^ Asn Leu Gin Tyr gIu gIu Asp Met 

12080 12085 12090 

yirt^c '^''9 "-ys Phe Asn Ala Thr gIu Arg gIu Leu 

12095 12100 12105 

"^ys P'"® Leu Phe Apg Asn ser ser Leu Glu Tyr 
12110 12115 12120 

T^r^r ^'"9 "-eu Ala ser Leu Arg Pro Glu Lys Asp 

12125 12130 12135 

"^^^^ '^^^ V*'' Asp Ala He cys Thr His Arg Pro Asp 
12140 12145 12150 

^IVrr '^^P Asp Arg Glu Arg Leu Tyr Trp Glu Leu 

12155 12160 12165 

^!!?,^ """hf Asn Gly lie Gin Glu Leu Gly Pro Tyr Thr Leu 
12170 12175 12180 

^Cfor ■'■y Asn Gly Phe Thr His Arg Ser Ser 

12185 12190 12195 

Thr ser Thr val Asp val Gly 
12200 12205 12210 

fSC,r P''*' S®"" ser Ser pro ser pro Thr Ala Ala Gly 

12215 12220 12225 

VS!i,« "Thr Leu Asn Phe Thr He Thr Asn Leu 

12230 12235 12240 

TJfC.r Asp Met Arg Arg Thr Gly ser Arg Lys Phe Asn 

12245 12250 12255 

S^"" 61 n Gly Leu Leu Lys Pro Leu Phe Lys 

12260 12265 12270 



Asn Thr ser val Gly pro Leu Tyr ser Gly Cys Arg Leu Thr 
12275 12280 12285 



Leu 



lo^on "-y^ Ala Ala Thr Gly val Asp Ala He 

12290 12295 12300 
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cys Thr His Arg Leu Asp Pro Lys Ser Pro Gly Leu Asn Arg Glu 
12305 12310 12315 



Gin Leu Tyr Trp Glu Leu Ser Lys Leu Thr Asn Asp lie Glu Glu 
12320 12325 12330 



Leu Gly Pro Tyr Thr Leu Asp Arg Asn Ser Leu Tyr val Asn Gly 
12335 12340 12345 



Phe Thr His Gin Ser ser val ser Thr Thr ser Thr Pro Gly Thr 
12350 12355 12360 



Ser Thr val Asp Leu Arg Thr Ser Gly Thr Pro Ser ser Leu ser 
12365 12370 12375 



ser Pro Thr He Met Ala Ala Gly Pro Leu Leu val Pro Phe Thr 
12380 12385 12390 



Leu Asn Phe Thr He Thr Asn Leu Gin Tyr Gly Glu Asp Met Gly 
12395 12400 12405 



His Pro Gly ser Arg Lys Phe Asn Thr Thr Glu Arg val Leu Gin 
12410 12415 12420 

Gly Leu Leu Gly Pro lie Phe Lys Asn Thr ser val Gly Pro Leu 
12425 12430 12435 



ser Gly cys Arg Leu Thr ser Leu Arg ser Glu Lys Asp Gly 
12440 12445 12450 



Ala Ala Thr Gly val Asp Ala He cys lie His His Leu Asp Pro 
12455 12460 12465 



Ser pro Gly Leu Asn Arg Glu Arg Leu Tyr Trp Glu Leu ser 
12470 ' 12475 12480 



Gin Leu Thr Asn Gly lie Lys Glu Leu Gly Pro Tyr Thr Leu Asp 
12485 12490 12495 



Arg Asn ser Leu Tyr val Asn Gly Phe Thr His Arg Thr Ser val 
12500 12505 12510 



Pro Thr ser ser Thr Pro Gly Thr ser Thr val Asp Leu Gly Thr 
12515 12520 12525 



ser Gly Thr Pro Phe ser Leu Pro ser pro Ala Thr Ala Gly Pro 
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i^^^y 12535 12540 

VfcAc tSy. Phe Thr He Thr Asn Leu Lys 

12545 12550 12555 

?5?<;n ''^^ "^^ ^Cier ^""y ser Arg Lys Phe Asn Thr 

1^560 12565 12570 

5]c-.c ^""^ II)''- Leu Gly Pro Met Phe Lys Asn 

12575 12580 12585 

'^^'^ ffwn T^r«. <^ly cys Arg Leu Thr Leu Leu 

12590 12595 12600 

'^''^ fffin.: "-y^ '^^P ^""y ^^ly val Asp Ala He Cys 

12605 12610 12615 

yll-)A '^''3 tJ^S, ser Pro Gly Leu Asp Arg Glu Gin 

12620 12625 12630 

T5(C,r ^""P ^'^^ "-eu ser Gin Leu Thr Asn Gly lie Lys Glu Leu 

12635 12640 12645 

?5ficn ^Cir 2^'' "-e" Tyr val Asn Gly Phe 

12650 12655 12660 

i^Lc '^''P fCS, """hr Ser Ser Thr Pro Gly Thr ser 

12665 12670 12675 

YtloA '^^P *^^y ser ser Leu pro Ser Pro 

12680 12685 12690 

lilac '^^^ ^^y tS"^ ^^^'l Pro P'le Thr Leu Asn Phe Thr 

12695 12700 12705 

TiJS^o ^'l" Glu Asp Met His His pro Gly Ser 

12710 12715 12720 

Ul,r '^^ Arg val Leu Gin Gly Leu Leu Gly 

12725 12730 12735 

Pro Met^^ Phe Lys Asn Thr ser^^ val Gly Leu Leu Tyr^^ ser Gly cys 

^''9 Leu Thr Leu Leu Arg ser Glu Lys Asp Gly Ala Ala Thr Gly 

12755 12760 12765 
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val Asp Ala lie Cys Thr His Arg Leu Asp Pro Lys ser Pro Gly 
12770 12775 12780 



Val Asp Arg Glu Gin Leu Tyr Trp Glu Leu Ser Gin Leu Thr Asn 
12785 12790 12795 



He Lys Glu Leu Gly Pro Tyr Thr Leu Asp Arg Asn ser Leu 
12800 12805 12810 



val Asn Gly Phe Thr His Gin Thr ser Ala Pro Asn Thr Ser 
12815 12820 12825 



Thr pro Gly Thr ser Thr val Asp Leu Gly Thr ser Gly Thr Pro 
12830 12835 12840 



ser ser Leu Pro ser pro Thr Ser Ala Gly Pro Leu Leu val Pro 
12845 12850 12855 



Phe Thr Leu Asn Phe Thr lie Thr Asn Leu Gin Tyr Glu Glu Asp 
12860 12865 12870 



Met Arg His Pro Gly Ser Arg . Lys Phe Asn Thr Thr Glu Arg Val 
12875 12880 12885 



Leu Gin Gly Leu Leu Lys Pro Leu Phe Lys Ser Thr Ser val Gly 
12890 12895 12900 



pro Leu Tvr ser Gly Cys Arg Leu Thr Leu Leu Arg ser Glu Lys 
12905 12910 12915 



Gly Ala Ala Thr Gly val Asp Ala lie Cys Thr His Arg Leu 
12920 12925 12930 



ASD Pro Lys ser Pro Gly val Asp Arg Glu Gin Leu Tyr Trp Glu 
12935 12940 12945 



Leu ser Gin Leu Thr Asn Gly He Lys Glu Leu Gly Pro Tyr Thr 
12950 12955 12960 



Leu Asp Arg Asn ser Leu Tyr Val Asn Gly Phe Thr His Gin Thr 
12965 12970 12975 



ser Ala Pro Asn Thr Ser Thr Pro Gly Thr ser Thr val Asp Leu 
12980 12985 . 12990 



Gly Thr Ser Gly Thr Pro ser Ser Leu Pro ser Pro Thr ser Ala 



12995 



13000 



13005 
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?anin ^^"^ flJS "T*"' "-eu Asn phe Thr He Thr Asn 

13010 13015 13020 

?looc ^""^ '^SP i*^" "is Pro Gly Ser Arg Lys Phe 

13025 13030 13035 

T5L« '^^'^ '^'"9 "-eu Gin Gly Leu Leu Gly pro Met Phe 
13040 13045 13050 

""^^ tfScc '^'^ ser Gly Cys Arg Leu Thr 

13055 13060 13065 

ifn^n ^''^ "-y^ ^U,. Ala Ala Thr Gly Met Asp Ala 

13070 13075 13080 

?^n«c """^ ^'"9 ^fR« Lys ser pro Gly Leu Asn Arg 
13085 13090 13095 

"'"y '■'''P Ser Gin Leu Thr His Gly He Lys 

13100 13105 13110 

™e "^^^ Leu Asp Arg Asn ser Leu Tyr val Asn 

13115 13120 13125 

Arg ser ser val Ala Pro Thr Ser Thr Pro Gly 
13130 13135 13140 

Thr Ser Thr Vil Asp Leu Gly Thr ser Gly Thr Pro ser Ser Leu 
131^J 13150 13155 

?fr^« ^f"" ''^'^ Ala val Pro Leu Leu val Pro phe Thr Leu 
13160 13165 13170 

Phe Thr He Thr Asn Leu Gin Tyr Gly Glu Asp Met Arg His 
13175 13180 13185 

A""9 Lys Phe Asn Thr Thr Glu Arg val Leu Gin Gly 
13190 13195 13200 

L®" tS" ^ <»ly Leu Phe Lys Asn ser ser val Gly pro Leu Tyr 
13205 13210 13215 

sef" Gly^^ cys Arg Leu He S|r^^ Leu Arg ser Glu L^s^^ Asp Gly Ala 

'^^^ T5o« '^^P cys Thr His His Leu Asn Pro Gin 

13235 13240 13245 

Page 256 



wo 2004/04SSS3 



PCTAJS2003/037041 





022438 . 43867 (pct2) . ST25 . txt 



Ser Pro Gly Leu Asp Arg Glu Gin Leu Tyr Trp Gin Leu ser Gin 
13250 13255 13260 



Met Thr Asn Gly He Lys Glu Leu Gly Pro Tyr Thr Leu Asp Arg 
13265 13270 13275 



Asn ser Leu Tyr Val Asn Gly Phe Thr His Arg Ser Ser Gly Leu 
13280 13285 13290 



Thr Thr Ser Thr Pro Trp Thr Ser Thr val Asp Leu Gly Thr Ser 
13295 13300 13305 



Thr pro Ser Pro val Pro Ser Pro Thr Thr Ala Gly Pro Leu 
13310 13315 13320 



Leu val pro Phe Thr Leu Asn Phe Thr lie Thr Asn Leu Gin Tyr 
13325 13330 13335 



Glu Glu Asp Met His Arg pro Gly Ser Arg Lys Phe Asn Ala Thr 



Glu Arg Val Leu Gin Gly Leu Leu Ser Pro He Phe Lys Asn Ser 
13355 13360 13365 



ser val Gly Pro Leu TVr Ser Gly Cys Arg Leu Thr ser Leu Arg 
13370 13375 13380 



pro Glu Lys Asp Gly Ala Ala Thr Gly Met Asp Ala val cys Leu 
13385 13390 13395 



His pro Asn Pro Lys Arg Pro Gly Leu Asp Arg Glu Gin Leu 
13400 13405 13410 



Trp Glu Leu Ser Gin Leu Thr His Asn lie Thr Glu Leu Gly 
13415 13420 13425 



Pro Tyr ser Leu Asp Arg Asp Ser Leu Tyr val Asn Gly Phe Thr 
15430 13435 13440 



His Gin Asn Ser val Pro Thr Thr ser Thr Pro Gly Thr Ser Thr 
13445 13450 13455 



Val Tyr Trp Ala Thr Thr Gly Thr Pro ser ser Phe Pro Gly His 
13460 13465 13470 



Thr Glu Pro Gly Pro Leu Leu He Pro Phe Thr Phe Asn Phe Thr 



13340 



13345 



13350 
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13475 13480 13485 

lie Thr Asn Leu His Tyr Glu Glu Asn Met Gin His Pro Gly ser 
13490 13495 13500 

Arg Lys Phe Asn Thr Thr Glu Arg val Leu Gin Gly Leu Leu Lys 
13505 13510 13515 

Pro Leu Phe Lys Asn Thr ser Val Gly Pro Leu Tyr Ser Gly Cys 
13520 13525 13530 

Arg Leu Thr Ser Leu Arg Pro Glu Lys Asp Gly Ala Ala Thr Gly 
13535 13540 13545 

Met Asp Ala Val Cys Leu Tyr His Pro Asn pro Lys Arg Pro Gly 
13550 13555 13560 

Leu Asp Arg Glu Gin Leu Tyr cys Glu Leu ser Gin Leu Thr His 
13565 13570 13575 

Asn lie Thr Glu Leu Gly Pro Tyr Ser Leu Asp Arg Asp ser Leu 
13580 13585 13590 

Tyr val Asn Gly Phe Thr His Gin Asn ser val Pro Thr Thr Ser 
13595 13600 13605 

Thr Pro Gly Thr ser Thr val Tyr Trp Ala Thr Thr Gly Thr Pro 
13610 13615 13620 

ser ser Phe Pro Gly His Thr Glu Pro Gly Pro Leu Leu lie Pro 
13625 13630 13635 

Phe Thr Phe Asn Phe Thr lie Thr Asn Leu His Tyr Glu Glu Asn 
13640 13645 13650 

Met Gin His Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg Val 
13655 13660 13665 

Leu Gin Gly Leu Leu Lys Pro Leu Phe Lys Asn Thr Ser Val Gly 
13670 13675 13680 

Pro Leu Tyr ser Gly Cys Arg Leu Thr Leu Leu Arg Pro Glu Lys 
13685 13690 13695 

His Glu Ala Ala Thr Gly Val Asp Thr He Cys Thr His Arg val 
13700 13705 13710 
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Asp pro lie Gly Pro Gly Leu Asp Arg Giu Arg Leu Tyr Trp Glu 
13715 13720 13725 

Leu Ser Gin Leu Thr Asn Ser lie Thr Glu Leu Gly Pro Tyr Thr 
13730 13735 13740 

Leu Asp Arg Asp ser Leu Tyr val Asn Gly Phe Asn Pro Arg Ser 
13745 13750 13755 

ser Val Pro Thr Thr Ser Thr Pro Gly Thr ser Thr val His Leu 
13760 13765 13770 

Ala Thr Ser Gly Thr Pro ser ser Leu Pro Gly His Thr Ala Pro 
13775 13780 13785 

val Pro Leu Leu lie Pro Phe Thr Leu Asn Phe Thr lie Thr Asn 
13790 13795 13800 

Leu His Tyr Glu Glu Asn Met • Gin His Pro Gly Ser Arg Lys Phe 
13805 13810 13815 

Asn Thr Thr Glu Arg Val Leu Gin Gly Leu Leu Lys Pro Leu Phe 
13820 13825 13830 

Lys Asn Thr ser val Gly Pro Leu Tyr Ser Gly Cys Arg Leu Thr 
13835 13840 13845 

Leu Leu Arg Pro Glu Lys His Glu Ala Ala Thr Gly val Asp Thr 
13850 13855 13860 

lie Cys Thr His Arg val Asp Pro He Gly Pro Gly Leu Asp Arg 
13865 13870 13875 

Glu Xaa Leu Tyr Trp Glu Leu ser xaa Leu Thr xaa xaa He xaa 
13880 13885 13890 

Glu Leu Gly Pro Tyr Xaa Leu Asp Arg xaa ser Leu Tyr val Asn 
13895 13900 13905 

Gly Phe xaa xaa xaa Xaa xaa Xaa Xaa Xaa Thr ser Thr Pro Gly 
13910 13915 13920 

Thr Ser xaa Val Xaa Leu Xaa Thr ser Gly Thr Pro Xaa Xaa xaa 
13925 13930 13935 

Pro Xaa Xaa Thr Ser Ala Gly pro Leu Leu val Pro Phe Thr Leu 
13940 13945 13950 
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?5Src 'Th'' lie Thr Asn Leu Gin Tyr Glu Glu Asp Met His His 
13955 13960 13965 

?lX,« '^•'9 "-y* Pile Asn Thp Thr Glu Arg val Leu Gin Gly 
13970 13975 13980 

Vfiior Phe Lys Asn Thr Ser Val Gly Leu Leu Tyr 

13985 13990 13995 

^""S "-eu "rhf Leu Leu Arg Pro Glu Lys Asn Gly Ala 
14000 14005 14010 

'^^^ TllLr "et Asp Ala lie Cys ser His Arg Leu Asp Pro Lys 
14015 14020 14025 

'^SP >^''9 Gin Leu Tyr Trp Glu Leu ser Gin 

14030 14035 14040 

l!?]^... ""^^ ^'^y Lys Glu Leu Gly Pro Tyr Thr Leu Asp Arg 
14045 14050 14055 

"-e" Asn Gly Phe Thr His Arg Ser ser Val Ala 

14060 14065 14070 

•''"^ Tl}^,^ 'i'^'' ^ly ■'"hr ser Thr val Asp Leu Gly Thr Ser 
14075 14080 14085 

°^y T'}^»^ ser ser Leu Pro ser Pro Thr ilir Ala val Pro Leu 
14090 14095 14100 

y?]„. P*'® Leu Asn Phe Thr He Thr Asn Leu Gin Tyr 
14105 14110 14115 

^^y '^^P '^''9 Sly ser Arg Lys Phe Asn Thr Thr 

14120 14125 14130 

^C?,- "-e" ^1" Gly Leu Leu Gly pro Leu Phe Lys Asn Ser 
14135 14140 14145 

Y?3r« ^ly "-e" "Tyr ser Gly cys Arg Leu He Ser Leu Arg 
14150 14155 14160 

Se*" G]"^^ Lys Asp Gly Ala Ala Thr Gly Val Asp Ala He Cys Thr 
14165 14170 14175 

""•^ yi!o« Gin Ser Pro Gly Leu Asp Arg Glu Gin Leu 

14180 14185 14190 
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Tyr Trp Gin Leu Ser Gin Met Thr Asn Gly lie Lys Glu Leu Gly 
14195 14200 14205 

Pro Tyr Thr Leu Asp Arg Asn ser Leu Tyr val Asn Gly Phe Thr 
14210 14215 14220 

His Arg Ser ser Gly Leu Thr Thr Ser Thr Pro Trp Thr ser Thr 
14225 14230 14235 

Val Asp Leu Gly Thr Ser Gly Thr Pro ser Pro val pro ser Pro 
14240 14245 14250 

Thr Thr Ala Gly Pro Leu Leu Val Pro Phe Thr Leu Asn Phe Thr 
14255 14260 14265 

lie Thr Asn Leu Gin Tyr Glu Glu Asp Met His Arg Pro Gly ser 
14270 14275 14280 

Arg Lys Phe Asn Ala Thr Glu Arg val Leu Gin Gly Leu Leu Ser 
14285 14290 14295 

pro lie Phe Lys Asn Ser ser val Gly Pro Leu Tyr ser Gly Cys 
14300 14305 14310 

Arg Leu Thr Ser Leu Arg Pro Glu Lys Asp Gly Ala Ala Thr Gly 
14315 14320 14325 

Met Asp Ala val cys Leu Tyr His Pro Asn Pro Lys Arg Pro Gly 
14330 14335 14340 

Leu Asp Arg Glu Gin Leu Tyr Trp Glu Leu Ser Gin Leu Thr His 
14345 14350 14355 

Asn lie Thr Glu Leu Gly Pro Tyr ser Leu Asp Arg Asp Ser Leu 
14360 14365 14370 

Tyr Val Asn Gly Phe Thr His Gin Ser ser Met Thr Thr Thr Arg 
14375 14380 14385 

Thr Pro Asp Thr ser Thr Met His Leu Ala Thr ser Arg Thr Pro 
14390 14395 14400 

Ala Ser Leu ser Gly Pro Thr Thr Ala Ser Pro Leu Leu val Leu 
14405 14410 14415 

Phe Thr He Asn Cys Thr He Thr Asn Leu Gin Tyr Glu Glu Asp 

page 261 



wo 2004/045553 PCT/US2003/037041 



, 022438 . 43867Cpct2) . ST25 . txt 
14420 14425 14430 

^''9 Ser Arg Lys Phe Asn Thr Met Glu ser val 

14435 14440 14445 

^^y "-y^ P''® '-eu Phe Lys Asn Thr ser Val Gly 

14450 14455 14460 

tf^/^c ^^y ^ys Arg Leu Thr Leu Leu Arg Pro Lys Lys 

14465 14470 14475 

'^^^ '^'^ *^''y v*"" Asp Ala lie cys Thr His Arg Leu 
14480 14485 14490 

"-y* ^'ly Leu Asn Arg Glu Gin Leu Tyr Trp Glu 

14495 14500 14505 

ffr,« Lys Leu Thr Asn Asp He Glu Glu Leu Gly pro Tyr Thr 
14510 14515 14520 

Leu ASP Arg Asn Ser Leu Tyr val Asn Gly Phe Thr His Gin Ser 
14525 14530 14535 

y?l.» '^^'^ "''h'' Ser Thr Pro Gly Thr Ser Thr val ASp Leu 
14540 14545 14550 

Arg Thr Ser Gly Thr Pro Ser ser Leu ser ser pro Thr He Met 
14555 14560 14565 

xaa xaa xaa pro Leu Leu xaa Pro Phe Thr Leu Asn Phe Thr He 
14570 14575 14580 

Leu xaa Tyr Glu Glu Xaa Met xaa Xaa Pro Gly ser Arg 
14585 14590 14595 

•■y^ Asn Thr Thr Glu Arg val Leu Gin Gly Leu Leu Arg Pro 

14600 14605 14610 

Leu Phe Lys Asn Thr Ser val ser Ser Leu TVr ser Gly cys Arg 
14615 14620 14625 

T'lL^ ^''9 Pf"** Lys Asp Gly Ala Ala Thr Arg Val 

14630 14635 14640 

^1?.- Ala cys Thr Tyr Arg Pro Asp Pro Lys Ser pro Gly Leu 
14645 14650 14655 

Page 262 



wo 2004/045553 PCT/US2003/037041 



022438 . 43867(pct2) . ST25 . txt 
ASP Arg Glu Gin Leu TVr Trp Glu Leu ser Gin Leu Thr His ser 
14660 14665 14670 

He Thr Glu Leu Gly Pro Tyr Thr Leu Asp Arg val ser Leu Tyr 
14675 14680 14685 

val Asn Gly Phe Asn Pro Arg ser Ser val Pro Thr Thr ser Thr 
14690 14695 14700 

Pro Gly Thr ser Thr val His Leu Ala Thr ser Gly Thr Pro ser 
14705 14710 14715 

ser Leu Pro Gly His Thr xaa Xaa xaa Pro Leu Leu xaa Pro Phe 
14720 14725 14730 

Thr Leu Asn Phe Thr He Thr Asn Leu Xaa Tyr Glu Glu Xaa Met 
14735 14740 14745 

Xaa Xaa Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu . Arg val Leu 
14750 14755 14760 

Gin Gly Leu Leu Lys Pro Leu Phe Arg Asn Ser Ser Leu Glu Tyr 
14765 14770 14775 

Leu Tyr ser Gly Cys Arg Leu Ala Ser Leu Arg Pro Glu Lys Asp 
14780 14785 14790 

ser ser Ala Met Ala val Asp Ala lie cys Thr His Arg Pro Asp 
14795 14800 14805 

Pro Glu Asp Leu Gly Leu Asp Arg Glu Arg Leu Tyr Trp Glu Leu 
14810 14815 14820 

ser Asn Leu Thr Asn Gly He Gin Glu Leu Gly Pro Tyr Thr Leu 
14825 14830 14835 

Asp Arg Asn ser Leu Tyr val Asn Gly Phe Thr His Arg Ser Ser 
14840 14845 14850 

Phe Leu Thr Thr Ser Thr Pro Trp Thr ser Thr Val Asp Leu Gly 
14855 14860 14865 

Thr Ser Gly Thr Pro ser Pro Val Pro Ser Pro Thr Thr Ala Gly 
14870 14875 14880 

pro Leu Leu val Pro phe Thr Leu Asn Phe Thr He Thr Asn Leu 
14885 14890 14895 
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Gin Tyr Glu Glu Asp Met His Arg Pro Gly ser Arg Arg Phe Asn 
14900 14905 14910 



Thr Thr Glu Arg Val Leu Gin Gly Leu Leu Thr Pro Leu Phe Lys 
14915 14920 14925 



Asn Thr Ser val Gly Pro Leu Tyr ser Gly Cys Arg Leu Thr Leu 
14930 14935 14940 



Leu Arg Pro Glu Lys Gin Glu Ala Ala Thr Gly val Asp Thr He 
14§45 14950 14955 



cys Thr His Arg val Asp Pro lie Gly Pro Gly Leu Asp Arg Glu 
14960 14965 14970 



Arg Leu Tyr Trp Glu Leu Ser Gin Leu Thr Asn ser lie Thr Glu 
14975 14980 14985 



Leu Gly Pro Tyr Thr Leu Asp Arg Asp Ser Leu Tyr val Asn Gly 
14990 14995 15000 



Phe Asn Pro Trp ser Ser val Pro Thr Thr ser Thr Pro Gly Thr 
15005 15010 15015 



ser Thr val His Leu Ala Thr Ser Gly Thr Pro ser ser Leu Pro 
15020 15025 15030 



Gly His Thr Ala Pro val Pro Leu Leu lie Pro Phe Thr Leu Asn 
150Sj 15040 15045 



Phe Thr lie Thr Asp Leu His Tyr Glu Glu Asn Met Gin His Pro 
15050 15055 15060 



ser Arg Lys Phe Asn Thr Thr Glu Arg val Leu Gin Gly Leu 
15065 15070 15075 



Leu Lys Pro Leu Phe Lys Ser Thr ser val Gly Pro Leu Tyr Ser 
15080 15085 15090 



Cys Arg Leu Thr Leu Leu Arg Pro Glu Lys His Gly Ala Ala 
15095 15100 15105 



Thr Gly val Asp Ala lie cys Thr Leu Arg Leu Asp Pro Thr Gly 
15110 15115 15120 



Pro Gly Leu Asp Arg Glu Arg Leu Tyr Trp Glu Leu Ser Gin Leu 



15125 



15135 
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Thr Asn ser val Thr Glu Leu Gly Pro Tyr Thr Leu Asp Arg Asp 
1S140 15145 15150 



Ser Leu Tyr val Asn Gly Phe Thr His Arg Ser Ser val Pro Thr 
15155 15160 15165 



Thr ser lie pro Gly Thr Ser Ala Val His Leu Glu Thr Ser Gly 
15170 15175 15180 



Thr Pro Ala Ser Leu Pro Gly His Thr Ala Pro Gly Pro Leu Leu 
15185 15190 15195 , 



Val Pro Phe Thr Leu Asn Phe Thr lie Thr Asn Leu Gin Tyr Glu 
15200 15205 15210 



Glu Asp Met Arg His Pro Gly Ser Arg Lys Phe Ser Thr Thr Glu 
15215 15220 15225 



val Leu Gin Gly Leu Leu Lys Pro Leu Phe Lys Asn Thr ser 
15230 15235 15240 



Val ser Ser Leu Tyr Ser Gly cys Arg Leu Thr Leu Leu Arg Pro 
15245 15250 15255 

Glu Lys Asp Gly Ala Ala Thr Arg val Asp Ala val cys Thr His 
15260 15265 15270 



Pro Asp pro Lys Ser pro Gly Leu Asp Arg Glu Arg Leu Tyr 
15275 15280 15285 



Lys Leu ser Gin Leu Thr His Gly He Thr Glu Leu Gly Pro 
15290 15295 15300 



Tyr Thr Leu Asp Arg His ser Leu Tyr Val Asn Gly Phe Thr His 
15305 15310 15315 



Gin Ser Ser Met Thr Thr Thr Arg Thr Pro Asp Thr Ser Thr Met 
15320 15325 15330 



His Leu Ala Thr ser Arg Thr Pro Ala ser Leu ser Gly Pro Thr 
15335 15340 15345 



Thr Ala Ser Pro Leu Leu Val Leu Phe Thr lie Asn Phe Thr lie 
15350 15355 15360 



Thr Asn Gin Arg Tyr Glu Glu Asn Met His His Pro Gly Ser Arg 
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15365 15370 15375 

•■y^ ^'l" Arg val Leu Gin Gly Leu Leu Arg Pro 

15380 15385 15390 

?t!S^. "-ys Asn Thr Ser val Gly pro Leu Tyr ser Gly cys Arg 
15395 15400 15405 

Tt?';!-^ I-®" '^'"9 P""® Lys Lys Asp Gly Ala Ala Thr Lys Val 
15410 15415 15420 

Asp Ala lie cys Thr Tyr Arg Pro Asp Pro Lys ser pro Gly Leu 
15425 15430 15435 

'^^P ^'^^ i-^" Trp Glu Leu ser Gin Leu Thr His Ser 

15440 15445 15450 

lie Thr Glu Leu Gly Pro Tyr Thr Gin Asp Arg Asp ser Leu Tyr 
15455 15460 15465 

^"1 » ^'if Arg Ser Ser val Pro Thr Thr Ser He 

15470 15475 15480 

Pro Gly Thr ser Ala val His Leu Glu Thr Ser Gly Thr Pro Ala 
15485 15490 15495 

ser Leu pro Gly His Thr Ala Pro Gly pro Leu Leu val Pro Phe 
15500 15505 15510 

Thr Leu Asn Phe Thr lie Thr Asn Leu Gin Tyr Glu Glu Asp Met 
15515 15520 15525 

Arg His Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg val Leu 
15530 15535 15540 

Gin Gly Leu Leu Lys Pro Leu Phe Lys Ser Thr Ser val Gly Pro 
15545 15550 15555 

Leu jyr ser Gly cys Arg Leu Thr Leu Leu Arg pro Glu Lys Arg 
15560 15565 15570 

Gly Ala Ala Thr Gly val Asp Thr lie cys Thr His Arg Leu Asp 
15575 15580 15585 

Pro Leu Asn Pro Gly Leu Asp Arg Glu Gin Leu Tyr Trp Glu Leu 
15590 15595 15600 
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Ser Lys Leu Thr Arg Gly lie lie Glu Leu Gly Pro Tyr Leu Leu 
15605 15610 15615 

ASP Arg Gly ser Leu Tyr Val Asn Gly Phe Thr His Arg Thr Ser 
15620 15625 15630 

Val pro Thr Thr ser Thr Pro Gly Thr ser Thr val Asp Leu Gly 
15635 15640 15645 

Thr ser Gly Thr Pro Phe ser Leu Pro ser Pro Ala xaa xaa Xaa 
15650 15655 15660 

Pro Leu Leu xaa Pro Phe Thr Leu Asn Phe Thr He Thr Asn Leu 
15665 15670 15675 

xaa Tyr Glu Glu xaa Met xaa Xaa Pro Gly Ser Arg Lys Phe Asn 
15680 15685 15690 

Thr Thr Glu Arg val Leu Gin Thr Leu Leu Gly Pro Met Phe Lys 
15695 15700 15705 

Asn Thr Ser Val Gly Leu Leu , Tyr Ser Gly cys Arg Leu Thr Leu 
15710 15715 15720 

Leu Arg Ser Glu Lys Asp Gly Ala Ala Thr Gly val Asp Ala He 
15725 15730 15735 

cys Thr His Arg Leu Asp Pro Lys Ser Pro Gly Val Asp Arg Glu 
15740 15745 15750 

Gin Leu Tyr Trp Glu Leu ser Gin Leu Thr Asn Gly lie Lys Glu 
15755 15760 15765 

Leu Gly Pro Tyr Thr Leu Asp Arg Asn ser Leu Tyr val Asn Gly 
15770 15775 15780 

Phe Thr His Trp He Pro Val Pro Thr Ser Ser Thr Pro Gly Thr 
15785 15790 15795 

Ser Thr Val Asp Leu Gly ser Gly Thr Pro Ser Leu pro Ser Ser 
15800 15805 15810 

Pro Thr Thr Ala Gly Pro Leu Leu val Pro Phe Thr Leu Asn Phe 
15815 15820 15825 

Thr He Thr Asn Leu Lys Tyr Glu Glu Asp Met His Cys Pro Gly 
15830 15835 15840 
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t^Lc "-y^ "^^^ J^a. '^'•g Val Leu Gin ser Leu Leu 

15845 15850 15855 

'•y* T*^'' ser Val Gly pro Leu Tyr ser Gly 

15860 15865 15870 

Arg Leu Thr Leu Leu Arg Ser Glu Lys Asp Gly Ala Ala Thr 

15875 15880 15885 

YIIqa ^'•9 "-eu Asp Pro Lys Ser Pro 

15890 15895 15900 

YfLc '^^P '^''9 T''P Glu Leu ser Gin Leu Thr 

15905 15910 15915 

5l^t>n "-^^ Thr Leu Asp Arg Asn ser 

15920 15925 15930 



T^Ji,r *^^y ''^'^ ■'■fi'' "is Gin Thr ser Ala pro Asn Thr 

15935 15940 15945 

TSScn "^^"^ s®'" It}'' val Asp Leu Gly Thr ser Gly Thr 

15950 15955 15960 

^''^ ffS^r s®'' S^'' P''o Thr xaa xaa xaa Pro Leu Leu xaa 

15965 15970 15975 

fcLn ''^^ T^i:-. Thr Asn Leu Xaa Tyr Glu Glu 

15980 15985 15990 

??fS«c ^'^y S®'' ^''9 Lys Phe Asn Thr Thr Glu Arg 

15995 16000 16005 

Vfi!,« *^^y Leu xaa Pro Xaa Phe Lys xaa Thr Ser val 

16010 16015 16020 

^^y ?fSnr ^^y Arg Leu Thr Leu Leu Arg xaa Glu 

16025 16030 16035 

'•^^ ^Ln ^"^^ ?IS.r v^"* '^sp xaa xaa cys xaa Xaa Xaa 

16040 16045 16050 

ilRrr ^ Leu Asp Arg Glu Xaa Leu Tyr Trp 

16055 16060 16065 

Vln^n "^^ xaa He xaa Glu Leu Gly Pro Tyr 

16070 16075 16080 
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xaa Leu Asp Arg Xaa ser Leu Tyr val Asn Gly Phe Thr His Trp 
16085 16090 16095 



lie Pro val Pro Thr Ser ser Thr pro Gly Thr ser Thr val Asp 
16100 16105 16110 



Leu Gly ser Gly Thr Pro Ser Ser Leu Pro ser Pro Thr Thr Ala 
16115 16120 16125 



Pro Leu Leu val Pro phe Thr Leu Asn Phe Thr lie Thr Asn 
16130 16135 16140 



Leu Lys Tyr Glu Glu Asp Met His cys Pro Gly Ser Arg Lys Phe 
16145 16150 16155 



Asn Thr Thr Glu Arg val Leu Gin Ser Leu Leu Gly Pro Met Phe 
16160 16165 16170 



Asn Thr ser Val Gly Pro Leu Tyr Ser Gly Cys Arg Leu Thr 
16175 16180 16185 



ser Leu Arg Ser Glu Lys Asp Gly Ala Ala Thr Gly val Asp Ala 
16190 16195 16200 



He cys Thr His Arg Val Asp Pro Lys ser Pro Gly Val Asp Arg 
15205 16210 16215 

Glu Gin Leu Tyr Trp Glu Leu ser Gin Leu Thr Asn Gly He Lys 
16220 16225 16230 



Glu Leu Gly Pro Tyr Thr Leu Asp Arg Asn ser Leu Tyr Val Asn 
16235 16240 16245 



Gly Phe Thr His Gin Thr ser Ala Pro Asn Thr ser Thr Pro Gly 
16250 16255 16260 



Thr Ser Thr val Asp Leu Gly Thr ser Gly Thr Pro Ser Ser Leu 
16265 16270 16275 



Pro Ser pro Thr Ser Ala Gly Pro Leu Leu val Pro Phe Thr Leu 
16280 16285 16290 



Asn Phe Thr lie Thr Asn Leu Gin Tyr Glu Glu Asp Met His His 
16295 16300 16305 



Pro Gly ser Arg Lys Phe Asn Thr Thr Glu Arg val Leu Gin Gly 
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16310 16315 16320 

Leu Leu Gly Pro Met Phe Lys Asn Thr Sep val Gly Leu Leu Tyr 
16325 16330 16335 

Ser Gly cys Arg Leu Thr Leu Leu Arg Pro Glu Lys Asn Gly Ala 
16340 16345 16350 

Ala Thr Gly Met Asp Ala He cys Thr His Arg Leu Asp Pro Lys 
16355 16360 16365 

Ser Pro Gly Leu Asp Arg Glu Xaa Leu Tyr xrp Glu Leu Ser xaa 
16370 16375 16380 

Leu Thr Xaa Xaa lie xaa Glu Leu Gly Pro Tyr xaa Leu Asp Arg 
16385 16390 16395 

Xaa ser Leu Tyr val Asn Gly Phe Xaa xaa xaa xaa xaa xaa xaa 
16400 16405 16410 

xaa Thr ser Thr Pro Gly Thr Ser xaa val xaa Leu Xaa Thr Ser 
16415 16420 16425 

Gly Thr Pro xaa xaa Xaa pro xaa xaa Thr Xaa xaa xaa Pro Leu 
16430 16435 16440 

Leu xaa pro Phe Thr Leu Asn Phe Thr lie Thr Asn Leu Xaa Tyr 
16445 16450 16455 

Glu Glu xaa Met Xaa Xaa Pro Gly Ser Arg Lys Phe Asn Thr Thr 
16460 16465 16470 

Glu Arg val Leu Gin Gly Leu Leu Lys Pro Leu Phe Arg Asn ser 
16475 16480 16485 

ser Leu Glu Tyr Leu Tyr Ser Gly Cys Arg Leu Ala Ser Leu Arg 
16490 16495 16500 

Pro Glu Lys Asp Ser Ser Ala Met Ala val Asp Ala lie cys Thr 
16505 16510 16515 

His Arg Pro Asp Pro Glu Asp Leu Gly Leu Asp Arg Glu Arg Leu 
16520 16525 16530 

Tyr Trp Glu Leu Ser Asn Leu Thr Asn Gly lie Gin Glu Leu Gly 
16535 16540 16545 
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pro Tyr Thr Leu Asp Arg Asn Ser Leu lyr Val Asn Gly Phe Thr 
16550 16555 16560 

His Arg ser ser Met Pro Thr Thr ser Thr Pro Gly Thr ser Thr 
16565 16570 16575 

val ASP Val Gly Thr ser Gly Thr Pro Ser Ser Ser Pro ser Pro 
16580 16585 16590 . 

Thr Thr Ala Gly Pro Leu Leu lie Pro Phe Thr Leu Asn Phe Thr 
16595 16600 16605 

lie Thr Asn Leu Gin Tyr Gly Glu Asp Met Gly His Pro Gly ser 
16610 16615 16620 

Arg Lys Phe Asn Thr Thr Glu Arg Val Leu Gin Gly Leu Leu Gly 
16625 16630 16635 

Pro lie Phe Lys Asn Thr ser Val Gly Pro Leu Tyr ser Gly cys 
16640 16645 16650 

Arg Leu Thr ser Leu Arg ser Glu Lys Asp Gly Ala Ala Thr Gly 
16655 16660 16665 

val Asp Ala lie cys lie His His Leu Asp Pro Lys ser Pro Gly 
16670 16675 16680 

Leu Asn Arg Glu Arg Leu Tyr Trp Glu Leu Ser Gin Leu Thr Asn 
16685 16690 1669E 

Gly He Lys Glu Leu Gly Pro Tyr Thr Leu Asp Arg Asn Ser Leu 
16700 16705 16710 

Tyr val Asn Gly Phe Thr His Arg Thr ser val pro Thr Thr Ser 
16715 16720 16725 

Thr Pro Gly Thr Ser Thr val Asp Leu Gly Thr ser Gly Thr Pro 
16730 16735 16740 

Phe Ser Leu Pro ser Pro Ala Thr Ala Gly Pro Leu Leu val Leu 
16745 16750 16755 

Phe Thr Leu Asn Phe Thr He Thr Asn Leu Lys Tyr Glu Glu Asp 
16760 16765 16770 

Met His Arg Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg val 
16775 16780 16785 
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9l^^r. ^'^^ ^""y Pro wet Phe Lys Asn Thr ser val Gly 

16790 16795 16800 

Vfoor Gly cys Arg Leu Thr Leu Leu Arg ser Glu Lys 

16805 16810 16815 

$1^/. ^"'y Val Asp Ala He cys Thr His Arg Leu 

16820 16825 16830 

?r2,r ^y^ ^ly asp Arg gIu Xaa Leu Tyr Trp Glu 

16835 16840 16845 

"-^^ '^^^ He xaa Glu Leu Gly pro Tyr Xaa 

16850 16855 16860 

ilS^:c ^^'^ I^^^ ^""y Phe xaa xaa Xaa Xaa 

16865 16870 16875 

'^^^'^ ser Thr Pro Gly Thr Ser Xaa val Xaa Leu 
16880 16885 16890 

^^y "^^^^^ P'^o xaa Pro xaa xaa Thr Xaa Xaa 

16895 16900 16905 

P'^o Phe Thr Leu Asn Phe Thr He Thr Asn 
16910 16915 16920 

^ "'"y*' ^1*^ xaa Met Xaa Xaa Pro Gly ser Arg Lys Phe 
169^5 16930 16935 

T5J[.^ *^hr Glu Arg val Leu Gin Gly Leu Leu Arg pro Val Phe 
16940 16945 16950 

*-y^ ^513.- r ^al Gly pro Leu Tyr ser Gly cys Arg Leu Thr 

16955 16960 16965 

tfJi^^ P^'o Lys Asp Gly Ala Ala Thr Lys val Asp Ala 
16970 16975 16980 

"'^hr Tyr Arg Pro Asp Pro Lys ser Pro Gly Leu Asp Arg 
16985 16990 16995 

5ll3^« "'"''P ^"l" Leu Ser Gin Leu Thr His ser He Thr 

17000 17005 17010 

ifii^c '^^^ "^hr Gin Asp Arg Asp ser Leu Tyr val Asn 

17015 17020 17025 
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Gly Phe Thr His Arg Ser ser val Pro Thr Thr ser lie Pro Gly 
17030 17035 17040 



Thr ser Ala val His Leu Glu Thr Thr Gly Thr Pro ser Ser Phe 
17045 17050 17055 



Pro Gly His Thr Glu Pro Gly Pro Leu Leu lie Pro Phe Thr Phe 
17060 17065 17070 



Asn Phe Thr lie Thr Asn Leu Arg Tyr Glu Glu Asn Met Gin His 
17075 17080 17085 



Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg Val Leu Gin Gly 
17090 17095 17100 



Leu Leu Thr Pro Leu Phe Lys Asn Thr Ser val Gly Pro Leu Tyr 
17105 17110 17115 



Ser Gly Cys Arg Leu Thr Leu Leu Arg Pro Glu Lys Gin Glu Ala 
17120 17125 17130 



Ala Thr Gly Val Asp Thr He Cys Thr His Arg val Asp Pro lie 
17135 17140 17145 



Pro Gly Leu Asp Arg Glu Arg Leu Tyr Trp Glu Leu ser Gin 
17150 17155 17160 



Leu Thr Asn ser He Thr Glu Leu Gly Pro Tyr Thr Leu Asp Arg 
17165 17170 17175 



Ser Leu Tyr val Asp Gly Phe Asn Pro Trp ser Ser Val Pro 
17180 17185 17190 



Thr Thr ser Thr Pro Gly Thr Ser Thr Val His Leu Ala Thr Ser 
17195 17200 17205 



Gly Thr Pro ser Pro Leu Pro Gly His Thr Ala Pro val Pro Leu 
17210 17215 17220 



Leu He pro Phe Thr Leu As n Phe Thr lie Thr Asp Leu His Tyr 
17225 17230 17235 



Glu Glu Asn Met Gin His Pro Gly Ser Arg Lys Phe Asn Thr Thr 
17240 17245 17250 



Glu Arg Val Leu Gin Gly Leu Leu Lys Pro Leu Phe Lys ser Thr 
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17255 17260 17265 

Y^Un "f^f* ser Gly cys Arg Leu Thr Leu Leu Arg 

17270 17275 17280 

5lii„r "-y^ """^ ^"'y ^''3 Ala Thr Gly val Asp Ala He cys Thr 
17285 17290 17295 

Leu Arg Leu Asp Pro Thr Gly Pro Gly Leu Asp Arg gIu Arg Leu 
17300 17305 17310 

HP,. Leu Ser Gin Leu Thr Asn Ser lie Thr Glu Leu Gly 
17315 17320 17325 

T^^,« Leu Asp Arg Asp ser Leu Tyr Val Asn Gly Phe Asn 
17330 17335 17340 

ICS ^ ser Ser Val Pro Thr Thr ser Thr Pro Gly Thr ser Thr 
17345 17350 17355 

SiS,^ Leu Ala Thr ser Gly Thr Pro Ser Ser Leu pro Gly His 
17360 17365 17370 

Leu Leu val Pro phe Thr Leu Asn Phe Thr 
17375 17380 17385 

TljC--. Leu Lys Tyr Glu Glu Asp Met His Cys pro Gly ser 
17390 17395 17400 

^""^ hX^^^ "''h'' Glu Arg val Leu Gin ser Leu His Gly 

17405 17410 17415 

SI'SS,^ "-ys '^sn Thr ser val Gly Pro Leu Tyr ser Gly cys 
17420 17425 17430 

!rS«r '^^ '^'"9 ser Glu Lys Asp Gly Ala Ala Thr Gly 

17435 17440 17445 

'^'^ Arg Leu Asp Pro Lys ser Pro Gly 

17450 17455 17460 

Leu Asp Arg Glu xaa Leu Tyr Trp Glu Leu ser xaa Leu Thr xaa 
17465 17470 17475 

nf-« Leu Gly Pro Tyr Xaa Leu Asp Arg xaa Ser Leu 

17480 17485 17490 
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Tyr val Asn Gly Phe Xaa xaa xaa Xaa Xaa Xaa xaa xaa Thr Ser 
17495 17500 17505 

Thr Pro Gly Thr Ser Xaa val xaa Leu xaa Thr Ser Gly Thr Pro 
17510 17515 17520 

xaa xaa xaa pro xaa xaa Thr Xaa xaa Xaa Pro Leu Leu Xaa Pro 
17525 17530 17535 

Phe Thr Leu Asn Phe Thr i1e Thr Asn Leu Xaa Tvr Glu Glu Xaa 
17540 17545 17550 

Met Xaa xaa Pro Gly ser Arg Lys Phe Asn Thr Thr Glu Arg val 
17555 17560 17565 

Leu Gin Gly Leu Leu xaa pro Xaa Phe Lys Xaa Thr ser val Gly 
17570 17575 17580 

Xaa Leu Tyr ser Gly cys Arg Leu Thr Leu Leu Arg xaa Glu Lys 
17585 17590 17595 

Xaa xaa Ala Ala Thr xaa val Asp xaa xaa Cys xaa xaa Xaa Xaa 
17600 17605 17610 

Asp Pro xaa Xaa Pro Gly Leu Asp Arg Glu xaa Leu Tyr Trp Glu 
17615 17620 17625 

Leu Ser Xaa Leu Thr Asn ser He Thr Glu Leu Gly pro Tyr Thr 
17630 17635 17640 

Leu Asp Arg Asp Ser Leu Tyr Val Asn Gly Phe Thr His Arg Ser 
17645 17650 17655 

ser Met pro Thr Thr ser lie Pro Gly Thr Ser Ala val His Leu 
17660 17665 17670 

Glu Thr ser Gly Thr Pro Ala Ser Leu Pro Gly His Thr Ala Pro 
17675 17680 17685 

Gly Pro Leu Leu Val Pro phe Thr Leu Asn Phe Thr lie Thr Asn 
17690 17695 17700 

Leu Gin Tyr Glu Glu Asp Met Arg His Pro Gly Ser Arg Lys Phe 
17705 17710 17715 

Asn Thr Thr Glu Arg val Leu Gin Gly Leu Leu Lys Pro Leu Phe 
17720 17725 17730 
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■-y* "'■'i'' Ser val Gly pro Leu xyr ser Gly cys Arg Leu Thr 

17735 17740 17745 

■■eu Leu Arg pro Glu Lys Arg Gly Ala Ala Thr Gly val Asp Thr 
17750 17755 17760 

SXl^. '^'"9 "-eu Asp Pro Leu Asn Pro Gly Leu Asp Arg 

17765 17770 17775 

■'"''P Ser xaa Leu Thr xaa xaa He Xaa 

17780 17785 17790 

tSy^r ''■y "-eu Asp Arg xaa Ser Leu Tyr Val Asn 

17795 17800 17805 

*"^y f!3S-» xsa xaa xaa Xaa xaa Thr ser Thr Pro Gly 

17810 17815 17820 

fSr«, "-eu xaa Thr ser Gly Thr Pro xaa xaa Xaa 

17825 17830 17835 

?52.« xaa Thr xaa xaa xaa Pro Leu Leu Xaa pro Phe Thr Leu 
17840 17845 17850 

fbS^^ '^^'^ Leu xaa Tyr Glu Glu xaa Met xaa xaa 

17855 17860 17865 

SlX,^ S®'' '^'"9 "-ys Asn Thr Thr Glu Arg val Leu Gin Gly 
17870 17875 17880 

Leu Leu xaa Pro xaa Phe Lys Xaa Thr ser val Gly xaa Leu Tyr 
17885 17890 17895 

9lY.^^ ^y^ '^''9 Leu Thr Leu Leu Arg Xaa Glu Lys xaa xaa Ala 
17900 17905 17910 

TilC,, Cys xaa Xaa xaa xaa Asp Pro xaa 

17915 17920 17925 

^ fE2,^ °^y '^''9 Glu Xaa Leu Tyr Trp Glu Leu ser xaa 

17930 17935 17940 

UlLr Leu Gly Pro Tyr Xaa Leu Asp Arg 

17945 17950 17955 

H'ii^^ '^'^ '^S" <^''y Phe His pro Arg ser ser val Pro 
17960 17965 17970 
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Thr Thr ser Thr Pro Gly Thr ser Thr Val His Leu Ala Thr Ser 
17975 17980 17985 

Gly Thr Pro Ser Ser Leu Pro Gly His Thr Ala Pro val Pro Leu 
17990 17995 18000 

Leu lie Pro Phe Thr Leu Asn Phe Thr lie Thr Asn Leu His Tyr 
18005 18010 18015 

Glu Glu Asn Met Gin His Pro Gly ser Arg Lys Phe Asn Thr Thr 
18020 18025 18030 

Glu Arg Val Leu Gin Gly Leu Leu Gly Pro Met Phe Lys Asn Thr 
18035 18040 18045 

ser Val Gly Leu Leu Tyr Ser Gly Cys Arg Leu Thr Leu Leu Arg 
18050 18055 18060 

pro Glu Lys Asn Gly Ala Ala Thr Gly Met Asp Ala lie cys Ser 
18065 18070 18075 

His Arg Leu Asp Pro Lys Ser Pro Gly Leu Asp Arg Glu xaa Leu 
18080 18085 18090 

Tyr Trp Glu Leu ser Xaa Leu Thr Xaa xaa lie xaa Glu Leu Gly 
18095 18100 18105 

Pro Tyr Xaa Leu Asp Arg xaa Ser Leu Tyr Val Asn Gly Phe Xaa 
18110 18115 18120 

xaa Xaa Xaa xaa Xaa Xaa xaa Thr Ser Thr Pro Gly Thr Ser Xaa 
18125 18130 18135 

Val Xaa Leu Xaa Thr Ser Gly Thr Pro xaa Xaa Xaa Pro xaa xaa 
18140 18145 18150 

Thr xaa xaa xaa Pro Leu Leu Xaa Pro Phe Thr Leu Asn Phe Thr 
18155 18160 18165 

He Thr Asn Leu xaa Tyr Glu Glu Xaa Met Xaa Xaa pro Gly Ser 
18170 18175 18180 

Arg Lys Phe Asn Thr Thr Glu Arg Val Leu Gin Gly Leu Leu Xaa 
18185 18190 18195 

Pro Xaa Phe Lys xaa Thr ser val Gly xaa Leu Tyr ser Gly Cys 
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18200 18205 18210 

Arg Leu Thr Leu Leu Arg xaa Glu Lys Xaa Xaa Ala Ala Thr xaa 
18215 18220 18225 

Va1 Asp Xaa Xaa cys Xaa xaa Xaa xaa Asp Pro xaa xaa Pro Gly 
, 18230 18235 18240 

Leu Asp Arg Glu Xaa Leu Tyr Trp Glu Leu ser xaa Leu Thr xaa 
18245 18250 18255 

xaa i1e Xaa Glu Leu Gly Pro ryr xaa Leu Asp Arg xaa ser Leu 
18260 18265 18270 

Tyr val Asn Gly Phe Thr His Gin Asn Ser val Pro Thr Thr Ser 
18275 18280 18285 

Thr Pro Gly Thr ser Thr val Tyr Trp Ala Thr Thr Gly Thr Pro 
18290 18295 18300 

Ser Ser Phe Pro Gly His Thr Glu Pro Gly Pro Leu Leu lie Pro 
18305 18310 18315 

Phe Thr Phe Asn Phe Thr lie Thr Asn Leu His Tyr Glu Glu Asn 
18320 18325 18330 

Met Gin His Pro Gly ser Arg Lys Phe Asn Thr Thr Glu Arg Val 
18335 18340 18345 

Leu Gin Gly Leu Leu Thr Pro Leu Phe Lys Asn Thr ser Val Gly 
18350 18355 18360 

Pro Leu Tyr Ser Gly Cys Arg Leu Thr Leu Leu Arg Pro Glu Lys 
18365 18370 18375 

Gin Glu Ala Ala Thr Gly Val Asp Thr lie Cys Thr His Arg val 
18380 18385 18390 

Asp Pro lie Gly Pro Gly Leu Asp Arg Glu xaa Leu Tyr Trp Glu 
18395 18400 18405 

Leu Ser xaa Leu Thr xaa xaa lie Xaa Glu Leu Gly Pro Tyr xaa 
18410 18415 18420 

Leu Asp Arg xaa Ser Leu Tyr val Asn Gly Phe xaa xaa xaa xaa 
18425 18430 18435 
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xaa xaa xaa xaa Thr ser Thr Pro Gly Thr ser xaa val xaa Leu 
18440 18445 18450 

xaa Thr ser Gly Thr pro xaa xaa xaa pro xaa xaa Thr xaa Xaa 
18455 18460 18465 

xaa pro Leu Leu Xaa Pro Phe Thr Leu Asn Phe Thr lie Thr Asn 
18470 18475 18480 

Leu Xaa Tyr Glu Glu xaa Met xaa xaa pro Gly ser Arg Lys Phe 
18485 18490 18495 

Asn Thr Thr Glu Arg Val Leu Gin Gly Leu Leu Xaa Pro Xaa Phe 
18500 18505 18510 

Lys xaa Thr ser val Gly xaa Leu Tyr Ser Gly cys Arg Leu Thr 
18515 18520 18525 

Leu Leu Arg xaa Glu Lys xaa Xaa Ala Ala Thr Xaa val Asp Xaa 
18530 18535 18540 

xaa cys xaa xaa xaa xaa Asp . Pro xaa xaa Pro Gly Leu Asp Arg 
18545 18550 18555 

Glu xaa Leu Tyr Trp Glu Leu Ser xaa Leu Thr xaa xaa lie xaa 
18560 18565 18570 

Glu Leu Gly pro Tyr xaa Leu Asp Arg xaa Ser Leu Tyr Val Asn 
18575 18580 1858 

Gly Phe Thr His Arg ser Ser val Pro Thr Thr Ser ser Pro Gly 
18590 18595 18600 

Thr Ser Thr val His Leu Ala Thr ser Gly Thr Pro ser ser Leu 
18605 18610 18615 

pro Gly His Thr Ala Pro val Pro Leu Leu He Pro Phe Thr Leu 
18620 18625 18630 

Asn Phe Thr lie Thr Asn Leu His Tyr Glu Glu Asn Met Gin His 
18635 18640 18645 

pro Gly ser Arg Lys Phe Asn Thr Thr Glu Arg Val Leu Gin Gly 
18650 18655 18660 

Leu Leu Lys Pro Leu Phe Lys ser Thr ser val Gly Pro Leu Tyr 
18665 18670 18675 
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5l^o« '^''9 '^^'^ "-e" Leu Arg Pro Glu Lys His Gly Ala 
18680 18685 18690 

T!!C«. '^sp Ala lie cys Thr Leu Arg Leu Asp Pro Thr 

18695 18700 18705 

Gly Pro Gly Leu Asp Arg Glu xaa Leu Tyr Trp Glu Leu ser xaa 
18710 18715 18720 

Leu Thr Xaa Xaa lie Xaa Glu Leu Gly Pro Tyr xaa Leu Asp Arg 
18725 18730 18735 

» '-^^ ""Vf Val Asn Gly Phe Xaa Xaa xaa Xaa xaa Xaa Xaa 
18740 18745, 18750 

TllSrr '^^ "Th"" ser Xaa val xaa Leu Xaa Thr ser 

18755 18760 18765 

Gly Thr Pro Xaa Xaa xaa Pro xaa xaa Thr xaa xaa xaa Pro Leu 
18770 18775 18780 

Leu xaa Pro Phe Thr Leu Asn Phe Thr He Thr Asn Leu Xaa Tyr 
18785 18790 18795 

Glu Glu xaa Met xaa xaa pro Gly Ser Arg Lys Phe Asn Thr Thr 
18800 18805 18810 

Glu Ar( val ueu Gin Gly Leu Leu xaa Pro xaa Phe Lys xaa Thr 
18615 18820 18825 

ser yal Gly xaa Leu Tyr ser Gly Cys Arg Leu Thr Leu Leu Arg 
18830 18835 18840 

xaa Glu Lys Xaa Xaa Ala Ala Thr Xaa Val Asp xaa ' xaa cys xaa 
18845 18850 18855 

xaa xaa xaa Asp Pro xaa xaa Pro Gly Leu Asp Arg Glu Xaa Leu 
18860 18865 18870 

Tyr Trp Glu Leu Ser xaa Leu Thr Xaa Xaa lie xaa Glu Leu Gly 
18875 18880 18885 

Pro Tyr xaa Leu Asp Arg xaa ser Leu Tyr Val Asn Gly Phe Thr 
18890 18895 18900 

His Arg Thr Ser val Pro Thr Thr Ser Thr Pro Gly Thr Ser Thr 
18905 18910 18915 
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val His Leu Ala Thr Ser Gly Thr Pro Ser ser Leu Pro Gly His 
18920 18925 18930 



Thr Ala Pro val Pro Leu Leu lie Pro Phe Thr Leu Asn Phe Thr 
18935 18940 18945 



lie Thr Asn Leu Gin Tyr Glu Glu Asp Met His Arg Pro Gly ser 
18950 18955 18960 



Lvs Phe Asn Thr Thr Glu Arg Val Leu Gin Gly Leu Leu ser, 
18965 18970 18975 



pro lie Phe Lys Asn Ser Ser Val Gly Pro Leu Tyr Ser Gly Cys 
18980 18985 18990 

Arg Leu Thr Ser Leu Arg Pro Glu Lys Asp Gly Ala Ala Thr Gly 
18995 19000 19005 



Met Asp Ala Val cys Leu Tyr His Pro Asn Pro Lys Arg pro Gly 
19010 19015 19020 



Leu Asp Arg Glu Gin Leu Tyr cys Glu Leu Ser Gin Leu Thr His 
19025 19030 19035 



Asn lie Thr Glu Leu Gly pro Tyr Ser Leu Asp Arg Asp ser Leu 
19040 19045 19050 



Val Asn Gly Phe Thr His Gin Asn ser Val Pro Thr Thr Ser 
19055 19060 19065 



Thr Pro Gly Thr Ser Thr val Tyr Trp Ala Thr Thr Gly Thr Pro 
19070 19075 19080 



ser ser Phe Pro Gly His Thr xaa Xaa xaa Pro Leu Leu xaa Pro 
19085 19090 19095 



Phe Thr Leu Asn Phe Thr lie Thr Asn Leu Xaa Tyr Glu Glu Xaa 
19100 19105 19110 



Met xaa xaa Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg val 
19115 19120 19125 



Leu Gin Gly Leu Leu Xaa Pro xaa Phe Lys Xaa Thr ser Val Gly 
19130 19135 19140 



xaa Leu Tyr ser Gly cys Arg Leu Thr Leu Leu Arg xaa Glu Lys 
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19145 19150 19155 

xaa xaa Ala Ala Thr xaa val Asp Xaa Xaa Cys Xaa Xaa Xaa Xaa 
19160 19165 19170 

Asp Pro xaa Xaa Pro Gly Leu Asp Arg 61 u Xaa Leu Tyr Trp Glu 
19175 19180 19185 

Leu Ser xaa Leu Thr xaa Xaa He Xaa Glu Leu Gly Pro Tyr xaa 
19190 19195 19200 

Leu Asp Arg Xaa ser Leu Tyr Val Asn Gly Phe Thr His Trp ser 
19205 19210 19215 

Ser Gly Leu Thr Thr ser Thr Pro Trp Thr Ser Thr val Asp Leu 
19220 19225 19230 

Gly Thr ser Gly Thr Pro ser Pro Val Pro Ser Pro Thr Thr Ala 
19235 19240 19245 

Gly Pro Leu Leu val Pro Phe Thr Leu Asn Phe Thr lie Thr Asn 
19250 19255 19260 

Leu Gin Tyr Glu Glu Asp Met His Arg pro Gly Ser Arg Lys Phe 
19265 19270 19275 

Asn Ala Thr Glu Arg val Leu Gin Gly Leu Leu ser Pro He Phe 
. 19280 19285 19290 

Lys Asn Thr ser val Gly Pro Leu Tyr ser Gly cys Arg Leu Thr 
19295 19300 19305 

Leu Leu Arg Pro Glu Lys Gin Glu Ala Ala Thr Gly val Asp Thr 
19310 19315 19320 

lie Cys Thr His Arg val Asp Pro lie Gly Pro Gly Leu Asp Arg 
19325 19330 19335 

Glu Xaa Leu Tyr Trp Glu Leu . Ser xaa Leu Thr xaa Xaa He Xaa 
19340 19345 19350 

Glu Leu Gly pro Tyr xaa Leu Asp Arg xaa Ser Leu Tyr val Asn 
19355 19360 19365 

Gly Phe xaa Xaa Xaa xaa xaa xaa xaa Xaa Thr Ser Thr Pro Gly 
19370 19375 19380 
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Thr ser xaa val xaa Leu xaa Thr Ser Gly Thr Pro Xaa xaa Xaa 
19385 19390 19395 

pro xaa Xaa Thr Xaa Xaa xaa Pro Leu Leu xaa Pro Phe Thr Leu 
19400 19405 19410 

Asn Phe Thr lie Thr Asn Leu Xaa Tyr Glu Glu xaa Met xaa Xaa 
19415 19420 19425 

Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg val Leu Gin Gly 
19430 19435 19440 

Leu Leu Xaa Pro xaa Phe Lys xaa Thr Ser val Gly xaa Leu Tyr 
19445 19450 19455 

Ser Gly cys Arg Leu Thr Leu Leu Arg xaa Glu Lys xaa xaa Ala 
19460 19465 19470 

Ala Thr Xaa Val Asp xaa xaa cys xaa Xaa xaa xaa Asp Pro xaa 
19475 19480 19485 

xaa Pro Gly Leu Asp Arg Glu xaa Leu Tyr Trp Glu Leu Ser Xaa 
19490 19495 19500 

Leu Thr xaa xaa lie Xaa Glu Leu Gly Pro Tyr Xaa Leu Asp Arg 
19505 19510 19515 

Xaa Ser Leu Tyr val Asn Gly Phe Thr His Arg ser Phe Gly Leu 
19520 19525 19530 

Thr Thr ser Thr Pro Trp Thr Ser Thr val Asp Leu Gly Thr Ser 
19535 19540 19545 

Gly Thr Pro Ser Pro Val Pro Ser Pro Thr Thr Ala Gly Pro Leu 
19550 19555 19560 

Leu val Pro Phe Thr Leu Asn Phe Thr He Thr Asn Leu Gin Tyr 
19565 19570 19575 

Glu Glu Asp Met His Arg Pro Gly Ser Arg Lys Phe Asn Thr Thr 
19580 19585 19590 

Glu Arg Val Leu Gin Gly Leu Leu Thr Pro Leu Phe Arg Asn Thr 
19S95 19600 19605 

Ser Val Ser ser Leu Tyr ser Gly Cys Arg Leu Thr Leu Leu Arg 
19610 19615 19620 
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"-y^ ^""^ ^""^^^ '^^'^ Arg val Asp Ala Val cys Thr 

19625 19630 iggSS 

""■^ iSLn fSC.r Leu Asp Arg Glu Xaa Leu 

19640 19645 19650 

T«Srr ^^f* "-eu Thr Xaa Xaa He xaa Glu Leu Gly 

19655 19660 19665 

TXr,„ '^sp Arg xaa ser Leu Tyr val Asn Gly Phe Xaa 

19670 19675 19680 

^^31 Th'' Ser Thr Pro Gly Thr ser Xaa 

19685 19690 19695 

Leu xaa Thr Ser Gly Thr Pro Xaa xaa xaa pro xaa Xaa 

19700 19705 19710 

'^^'^ P""** Leu Leu Xaa Pro Phe Thr Leu Asn Phe Thr 

19715 19720 19725 

Leu Xaa Tyr Glu Glu Xaa Met xaa xaa pro Gly ser 

19730 19735 19740 

^■"S Thr Thr Glu Arg val Leu Gin Gly Leu Leu Xaa 

19745 19750 19755 

Lys xaa Thr Ser val Gly xaa Leu Tyr ser Gly cys 

19760 19765 19770 

'^'"9 Leu Leu Arg Xaa Glu Lys xaa Xaa Ala Ala Thr xaa 

19775 19780 19785 

<?5B«« <^ys ^sa xaa xaa Xaa Asp Pro xaa xaa Pro Gly 

19790 19795 19800 

L®" ^sp Arg Glu xaa Leu Tyr Trp Glu Leu ser xaa Leu Thr xaa 

19805 19810 19815 

ilf^^ Leu Gly Pro Tyr xaa Leu Asp Arg xaa ser Leu 

19820 19825 19830 

'^^ VSL^ P*^® "■^f ""is Trp He Pro Val Pro Thr ser Ser 

19835 19840 19845 

?I12n« Ser Thr val Asp Leu Gly Ser Gly Thr Pro Ser 

19850 19855 19860 
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ser Leu Pro Ser Pro Thr Thr Ala Gly Pro Leu Leu val Pro Phe 
19865 19870 1987^ 

Thr Leu Asn Phe Thr He Thr Asn Leu Gin Tyr Gly Glu Asp Met 
19880 19885 IS^y" 

Gly His pro Gly ser Arg Lys Phe Asn Thr Thr Glu Arg val Leu 
^ 19895 19900 i»3ud 

Gin Gly Leu Leu Gly Pro lie Phe Lys Asn Thr ser Val Gly Pro 
19910 19915 xy^^^u 

Leu Tyr Ser Gly cys Arg Leu Thr Ser Leu Arg ser Glu Lys Asp 
19925 19930 ^^^^^ 

Gly Ala Ala Thr Gly val Asp Ala He cys He His His Leu Asp 
19940 19945 xy»Du 

pro ser P™ Gly L.u Ar^ 6lu xa. Leu Trp Glu Ueu 

ser x« ^ Leu Tl.r xaa xaa Ile^^ xaa Glu Leu Gly Pro^^ lyr xaa Leu 

ASP A^^j xaa ser Leu Tyr V|l^ Asn Gly Phe xaa xg^^ xaa xaa xaa 

xaa xaa xaa Tlir ser Thr Pro Gly Thr ser xaa vaj xaa Leu xaa 
20000 20005 ^uuiu 

Thr ser Gly Thr Pro xaa Xaa Xaa Pro Xaa xaa Thr Xaa xaa xaa 
20015 20020 ^uu^3 

pro Leu Leu Xaa Pro Phe Thr Leu Asn Phe Thr lie Thr Asn Leu 
20030 20035 iiuuiu 

xaa Tyr Glu Glu Xaa Met xaa xaa Pro Gly ser Arg Lys Phe Asn 
20045 20050 ^uua^ 

Thr Thr Glu Arg val Leu Gin Gly Leu Leu xaa Pro xaa Phe Lys 
20060 20065 ^uu/u 

xaa Thr Ser val Gly Xaa Leu Tyr ser Gly cys Arg Leu Thr Leu 
20075 20080 ^suuod 

Leu Arg Xaa Glu Lys xaa xaa Ala Ala Thr xaa val Asp xaa xaa 
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20090 20095 20100 

cys Xaa xaa xaa xaa Asp Pro xaa xaa Pro Gly Leu Asp Arg Glu 
20105 20110 20115 

xaa Leu Tyr Trp Glu Leu ser Xaa Leu Thr Xaa xaa lie Xaa Glu 
20120 20125 20130 

Leu Gly pro Tyr xaa Leu Asp Arg Xaa ser Leu Tyr Val Asn Gly 
20135 20140 20145 

Phe Thr His Gin Thr Phe Ala Pro Asn Thr Ser Thr Pro Gly Thr 
20150 20155 20160 

Ser Thr Val Asp Leu Gly thr ser Gly Thr Pro Ser Ser Leu Pro 
20165 20170 20175 

Ser Pro Thr Ser Ala Gly Pro Leu Leu val Pro Phe Thr Leu Asn 
20180 20185 20190 

Phe Thr lie Thr Asn Leu Glti Tyr Glu Glu Asp Met His His Pro 
20195 20200 20205 

Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg val Leu Gin Gly Leu 
20210 20215 20220 

Leu Gly Pro Met Phe Lys Asn Thr Ser val Gly Leu Leu Tyr Ser 
20225 20230 20235 

Gly Cys Arg Leu Thr Leu Leu Arg Pro Glu Lys Asn Gly Ala Ala 
20240 20245 20250 

Thr Arg val Asp Ala Val Cys Thr His Arg Pro Asp Pro Lys ser 
20255 20260 20265 

pro Gly Leu Asp Arg Glu xaa Leu Tyr Trp Glu Leu ser xaa Leu 
20270 20275 20280 

Thr xaa xaa ile xaa Glu Leu Gly Pro Tyr xaa Leu Asp Arg xaa 
20285 20290 20295 

ser Leu Tyr val Asn Gly Phe xaa Xaa xaa Xaa xaa xaa xaa Xaa 
20300 20305 20310 

Thr ser Thr Pro Gly Thr ser xaa Val xaa Leu xaa Thr Ser Gly 
20315 20320 20325 
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Thr pro xaa xaa xaa Pro xaa xaa Thr Ala pro val Pro Leu Leu 
20330 20335 . 20340 

lie pro Phe Thr Leu Asn Phe Thr lie Thr Asn Leu His Tyr Glu 
20345 20350 20355 

Glu Asn Met Gin His Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu 
20360 20365 20370 

Arg Val Leu Gin Gly Leu Leu Arg Pro Leu Phe Lys ser Thr ser 
20375 20380 20385 

val Gly Pro Leu Tyr Ser Gly Cys Arg Leu Thr Leu Leu Arg Pro 
20390 20395 20400 

Glu Lys His Gly Ala Ala Thr Gly val Asp Ala lie cys Thr Leu 
20405 20410 20415 

Arg Leu Asp Pro Thr Gly Pro Gly Leu Asp Arg Glu Arg Leu Tyr 
20420 20425 20430 

Trp Glu Leu Ser Gin Leu Thr Asn ser val Thr Glu Leu Gly Pro 
20435 20440 20445 

Tyr Thr Leu Asp Arg Asp ser Leu Tyr Val Asn Gly Phe Thr Gin 
20450 20455 20460 

Arg Ser Ser Val Pro Thr Thr ser He Pro Gly Thr ser Ala val 
20465 20470 2047 ■ 

His Leu Glu Thr Ser Gly Thr Pro Ala Ser Leu Pro Gly His Thr 
20480 20485 20490 

Ala Pro Gly Pro Leu Leu val Pro Phe Thr Leu Asn Phe Thr He 
20495 20500 20505 

Thr Asn Leu Gin Tyr Glu val Asp Met Arg His Pro Gly Ser Arg 
20510 20515 20520 

Lys Phe Asn Thr Thr Glu Arg Val Leu Gin Gly Leu Leu Lys Pro 
20525 20530 20535 

Leu Phe Lys ser Thr ser val Gly Pro Leu Tyr Ser Gly cys Arg 
20540 20545 20550 

Leu Thr Leu Leu Arg Pro Glu Lys Arg Gly Ala Ala Thr Gly val 
20555 20560 20565 
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lJli:,« "^^^ Arg Leu Asp pro Leu Asn Pro 61 y Leu 

20570 20575 20580 

'^^P ■'■'•P Leu ser Lys Leu Thr Arg Gly 

20585 20590 20595 

Leu Gly Pro Tyr Leu Leu Asp Arg Gly ser Leu Tyr 
20600 20605 20610 

P*'® Th'' "■'S Arg Asn Phe val Pro He Thr ser Thr 
20615 20620 20625 

Pro Gly Thr ser Thr val His Leu Gly Thr ser Glu Thr Pro Ser 
20630 20635 20640 

kfH - ^""^ '^''9 Pi'o iTfi val Pro Gly Pro Leu Leu val pro Phe 
20645 20650 20655 

Thr Leu Asn Phe Thr He Thr Asn Leu Gin Tyr Glu Glu Ala Met 
20660 20665 20670 

Arg His Pro Gly ser Arg Lys Phe Asn Thr Thr Glu Arg val Leu 
20675 20680 20685 

Gin Gly Leu Leu Arg Pro Leu Phe Lys Asn Thr ser He Gly Pro 
20690 20695 20700 

IX » 5®'' Arg Leu Thr Leu Leu Arg Pro Glu Lys Asp 

20705 20710 20715 

•^y^ ^l3,» "^^^ '^•'9 Val Asp Ala He Cys Thr His His Pro Asp 
20720 20725 20730 

Pro Gin ser Pro Gly Leu Asn Arg Glu Gin Leu TVr Trp Glu Leu 
20735 20740 20745 

ser Gin Leu Thr His Gly lie Thr Glu Leu Gly Pro Tyr Thr Leu 
20750 20755 20760 

'^^P ^19,. '^^P S®"* Leu lyr val Asp Gly Phe Thr His Trp ser Pro 
20765 20770 20775 

?i;2 ^ '^'^ Pfo Gly Thr ser He Val Asn Leu Gly 

20780 20785 20790 

^^y ''■'o ser Leu Pro Glu Thr Thr xaa xaa Xaa 

20795 20800 20805 
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Pro Leu Leu xaa Pro Phe Thr Leu Asn Phe Thr lie Thr Asn Leu 
20810 20815 

xaa Tyr Glu Glu xaa Met xaa xaa Pro Gly ser Arg Lys Phe Asn 
20825 20830 20835 

Thr Thr Glu Arg val Leu Gin Gly Leu Leu Lys Pro Leu Phe Lys 
20840 20845 2085U 

ser Thr^^ Ser val Gly Pro Leu^^ Tyr ser Gly cys Arj^^ Leu Thr Leu 

Leu Arj^^ pro Glu Lys Asp Glj^^ val Ala Thr Arg val^^ Asp Ala He 

cys Thr His Arg Pro Asp Pro Lys He Pro Gly Leu Asp Arg Gin 
' 20885 20890 20895 

Gin Leu Tyr Trp Glu Leu ser Gin Leu Thr His Ser He Thr Glu 
20900 20905 zusj-u 

Leu Gly pro Tyr Thr Leu Asp Arg Asp ser Leu Tyr val Asn Gly 
20915 20920 iw^-o 

Phe Thr Gin Arg ser Ser val Pro Thr Thr ser Thr Pro Gly Thr 
20930 20935 /u»w 

Phe Thr val Gin Pro Glu ]rhr Ser Glu Thr Pro ser ser Leu Pro 
20945 20950 l\3v^:> 

Gly pro Thr Ala Thr Gly Pro val Leu Leu Pro Phe Thr Leu Asn 
20960 20965 ^ua/u 

Phe Thr He Thr Asn Leu Gin Tyr Glu Glu Asp Met His Arg Pro 
20975 20980 zuyus 

Gly ser Arg Lys Phe Asn Thr Thr Glu Arg val Leu Gin Gly Leu 
20990 20995 ^auuu 

Leu Met pro Leu Phe Lys Asn Thr ser val ser ser Leu Tyr ser 
21005 21010 

Gly cys Arg Leu Thr Leu Leu Arg Pro Glu Lys Asp Gly Ala Ala 
21020 21025 21030 

Thr Ara Val Asp Ala val cys Thr His Arg Pro Asp Pro Lys ser 

page 289 



wo 2004/045553 PCT/US2003/037041 



022438. 43867 (pct2). ST25.txt 
21035 21040 21045 

^''^ SlX.^ ^''9 ^'l" Arg Leu Tyr Trp Lys Leu Ser Gin Leu 

21050 21055 21060 

Thr His Gly He Thr Glu Leu Gly Pro Tyr Thr Leu Asp Arg His 
21065 21070 21075 

ser Leu Tyr Val Asn Gly Phe Thr His Gin ser Ser Met Thr Thr 
21080 21085 21090 

Thr Arg Thr pro Asp Thr ser Thr Met His Leu Ala Thr Ser Arg 
21095 21100 21105 

Thr Pro Ala Ser Leu ser Gly Pro Thr Thr Ala sen Pro Leu Leu 
21110 21115 21120 

val Leu Phe Thr He Asn Phe Thr lie Thr Asn Leu Arg Tyr Glu 
21125 21130 21135 

Glu Asn Met His His Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu 
21140 21145 21150 

Arg val Leu Gin Gly Leu Leu Arg Pro val Phe Lys Asn Thr ser 
21155 21160 21165 

Val Gly Pro Leu Tyr Ser Gly cys Arg Leu Thr Leu Leu Arg Pro 
21170 21175 21180 

Lys Lys Asp Gly Ala Ala Thr Lys val Asp Ala lie Cys Thr Tyr 
21185 21190 21195 

Arg Pro Asp Pro Lys Ser Pro Gly Leu Asp Arg Glu Gin Leu Tyr 
21200 21205 21210 

Trp Glu Leu Ser Gin Leu Thr His Ser lie Thr Glu Leu Gly Pro 
21215 21220 21225 

Tyr Thr Gin Asp Arg Asp Ser Leu Tyr Asn Val Gly Phe Thr Gin 
21230 21235 21240 

Arg ser ser Val Pro Thr Thr Ser val Pro Gly Thr Pro Thr Val 
21245 21250 21255 

Asp Leu Gly Thr Ser Gly Thr Pro Val Ser Lys Pro Gly Pro Ser 
21260 21265 21270 
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Ala Ala ser pro Leu Leu val Leu Phe Thr Leu Asn Gly Thr lie 
21275 21280 21285 



Thr Asn Leu Arg Tvr Glu Glu Asn Met Gin His Pro Gly Ser Arg 
21290 21295 21300 



Phe Asn Thr Thr Glu Arg Val Leu Gin Gly Leu Leu Arg Ser 
21305 21310 21315 



Leu Phe Lys ser Thr ser val Gly Pro Leu Tyr ser Gly cys Arg 
21320 21325 21330 



Leu Thr Leu Leu Arg Pro Glu Lys Asp Gly Thr Ala Thr Gly Val 
21335 21340 21345 



Asp Ala He cys Thr His His Pro Asp Pro Lys Ser Pro Arg Leu 
21350 21355 21360 



Arg Glu Gin Leu Tyr Trp Glu Leu Ser Gin Leu Thr His Asn 
21365 21370 21375 



lie Thr Glu Leu Gly His Tyr Ala Leu Asp Asn Asp ser Leu Phe 
21380 21385 21390 



val Asn Gly Phe Thr His Arg ser ser val ser Thr Thr ser Thr 
21395 21400 21405 



Pro Gly Thr Pro Thr val Tyr Leu Gly Ala ser Lys Thr Pro Ala 
21410 21415 21420 

ser lie Phe Gly Pro ser Ala Ala ser His Leu Leu lie Leu Phe 
21425 21430 21435 



Thr Leu Asn Phe Thr lie Thr Asn Leu Arg Tyr Glu Glu Asn Met 
21440 21445 21450 



Pro Gly Ser Arg Lys Phe Asn Thr Thr Glu Arg val Leu Gin 
21455 21460 21465 



Gly Leu Leu Arg Pro Leu Phe Lys Asn Thr Ser Val Gly Pro Leu 
21470 21475 21480 



Ser Gly ser Arg Leu Thr Leu Leu Arg Pro Glu Lys Asp Gly 
21485 21490 21495 



Glu Ala Thr Gly val Asp Ala He Cys Thr His Arg Pro Asp Pro 



21500 



21505 
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?Glu Gin Leu Tyr Leu 
20 21525 



Thr P'^o Leu Asp Arg_ Glu Gin Leu Tyr Leu _ Glu Leu ser 



Gin Leu Thr His ser lie Thr Glu Leu Gly Pro Tyr Thr Leu Asp 
21530 21535 21540 

Arg Asp ser Leu Tyr val Asn Gly Phe Thr His Arg ser ser val 
21545 21550 21555 

pro Thr Thr Ser Thr Gly Val Val ser Glu Glu Pro Phe Thr Leu 
21560 21565 21570 

Asn Phe Thr lie Asn Asn Leu Arg Tyr Met Ala Asp Met Gly Gin 
21575 21580 21585 

pro Gly ser Leu Lys Phe Asn lie Thr Asp Asn val Met Lys His 
21590 21595 21600 

Leu Leu ser Pro Leu Phe Gin Arg Ser Ser Leu Gly Ala Arg Tyr 
21605 21610 21615. 

Thr Gly cys Arg val He Ala Leu Arg Ser val Lys Asn Gly Ala 
21620 21625 21630 

Glu Thr Arg val Asp Leu Leu Cys Thr Tyr Leu Gin Pro Leu Ser 
21635 21640 21645 

Gly Pro Gly Leu Pro He Lys Gin Val Phe His Glu Leu Ser Gin 
21650 21655 21660 

Gin Thr His Gly He Thr Arg Leu Gly Pro Tyr ser Leu Asp Lys 
21665 21670 21675 

Asp Ser Leu Tyr Leu Asn Gly Tyr Asn Glu Pro Gly Leu Asp Glu 
21680 21685 21690 

Pro pro Thr Thr Pro Lys Pro Ala Thr Thr Phe Leu pro Pro Leu 
21695 21700 21705 

ser Glu Ala Thr Thr Ala Met Gly Tyr His Leu Lys Thr Leu Thr 
21710 21715 21720 

Leu Asn Phe Thr lie Ser Asn Leu Gin Tyr Ser Pro Asp Met Gly 
21725 21730 21735 

' Lys Gly ser Ala Thr Phe Asn Ser Thr Glu Gly val Leu Gin His 
21740 21745 21750 
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Leu Leu Arg Pro Leu.Phe Gin Lys Ser Ser Met Gly Pro Phe Tyr 
21755 21760 21765 

Leu Gly cys Gin Leu lie Ser Leu Arg Pro Glu Lys Asp Gly Ala 
21770 21775 21780 

Ala Thr Gly val Asp Thr Thr cys Thr Tyr His Pro Asp Pro val 
21785 21790 21795 

Glv Pro Gly Leu Asp lie Gin Gin Leu Tyr Trp Glu Leu Ser Gin 
21800 21805 21810 

Leu Thr His Gly Val Thr Gin Leu Gly Phe Tyr Val Leu Asp Arg 
21815 21820 21825 

ASP ser Leu Phe lie Asn Gly Tyr Ala Pro Gin Asn Leu ser lie 
21830 21835 21840 

Ara Gly Glu Tyr Gin He Asn Phe His He val Asn Trp Asn Leu 
^ 21845 21850 21855 

ser Asn Pro Asp Pro Thr ser ser Glu Tyr lie Thr Leu Leu Arg 
21860 21865 21870 

Asp He Gin ASP Lys val Thr Thr Leu Tyr Lys Gly ser Gin Leu 
21875 21880 21885 

His ASP Thr Phe Arg Phe cys Leu Val Thr Asn Leu Thr Met Asp 
21890 21895 21900 

ser val Leu val Thr Val Lys Ala Leu Phe ser ser Asn Leu Asp 
21905 21910 21915 

Pro ser Leu val Glu Gin val Phe Leu Asp Lys Thr Leu Asn Ala 
21920 21925 21930 

ser Phe His Trp Leu Gly ser Thr Tyr Gin Leu val Asp lie His 
21935 21940 21945 

val Thr Glu Met Glu Ser Ser val Tyr Gin Pro Thr Ser Ser ser 
21950 21955 21960 

ser Thr Gin His Phe Tyr Leu Asn Phe Thr lie Thr Asn Leu Pro 
21965 21970 21975 

Tyr Ser Gin Asp Lys Ala Gin Pro Gly Thr Thr Asn Tyr Gin Arg 
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21980 21985 21990 

Asn Lys Arg Asn i1e Glu Asp Ala Leu Asn Gin Leu Phe Arg Asn 
21995 22000 22005 

ser Ser lie Lys Ser Tyr Phe Ser Asp Cys Gin val Ser Thr Phe 
22010 22015 22020 

Arg Ser val Pro Asn Arg His His Thr Gly Va1 Asp Ser Leu cys 
22025 22030 22035 

Asn Phe ser Pro Leu Ala Arg Arg val Asp Arg Val Ala lie Tyr 
22040 22045 22050 

Glu Glu Phe Leu Arg Met Thr Arg Asn Gly Thr Gin Leu Gin Asn 
22055 22060 22065 

Phe Thr Leu Asp Arg ser ser val Leu val Asp Gly Tyr ser Pro 
22070 22075 22080 

Asn Arg Asn Glu Pro Leu Thr Gly Asn ser Asp Leu Pro Phe Trp 
22085 22090 22095 

Ala Val He Leu lie Gly Leu Ala Gly Leu Leu Gly Leu He Thr 
22100 22105 22110 

cys Leu lie cys Gly val Leu val Thr Thr Arg Arg Arg Lys Lys 
22115 22120 22125 

Glu Gly Glu Tyr Asn Val Gin Gin Gin cys Pro Gly Tyr Tyr Gin 
22130 22135 22140 

Ser His Leu Asp Leu Glu Asp Leu Gin 
22145 22150 

<210> 6 

<211> 32 

<212> DNA 

<213> synthetic 

<400> 6 

accggatcca tgggccacac agagcctggc cc 32 

<210> 7 

<211> 29 

<212> DNA 

<213> synthetic 

<400> 7 

tgtaagctta ggcagggagg atggagtcc 29 
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